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TRANSCYTOSIS OF ADENO- ASSOCIATED VIRUSES 



CROSS-REFERENCE TO RELATED APPLICATIONS 

This claims the benefit of U.S. Provisional Application No. 60/607,854, entitled 
"Transcytosis of Adeno- Associated Viruses", filed September 8, 2004, by Chiorini et al, 
10 which is herein incorporated by reference in its entirety. 

BACKGROUND 

The adeno-associated viruses (AAV) were originally classified according to size, 
structure, and dependence upon a helper virus for replication. AAV is a member of the 
Parvoviridae, a virus family characterized by a single stranded linear DNA genome and a 

15 small icosahedral shaped capsid measuring about 20nm in diameter. AAV was first 

described as a contaminant of tissue culture grown simian virus 15, a simian adeno virus 
and was found dependent on adenovirus for measurable replication. This led to its name, 
adeno-associated virus, and its classification in the genus Dependovirus. Because the 
majority of AAV isolates were first identified as contaminants of laboratory stocks of 

20 adenovirus, little is known about their natural tissue tropism. However in vivo experiments 
suggest they are effective vectors for gene transfer applications. Currently eleven full-length 
isolates have been cloned and their initial characterization indicates that each serotype has 
unique binding/cell tropism characteristics. 

Transcytosis is the transport of macromolecular cargo from one side of a cell to the 

25 other within membrane-bounded carriers). It is a strategy used by multicellular organisms 
to selectively move material between two different environments while maintaining the 
distinct compositions of those environments. The ability of a pathogen to spread through a 
tissue is a critical determinate of its virulence. The process of transcytosis has been reported 
for a number of viruses. For example, HIV and poliovirus cross simple epithelial cells 

30 without infection and are still infectious when they cross into the submucosa. Likewise, the 
Epstein-Barr virus (EBV) forms a complex with mucosal immunoglobulins (IgA) that are 
specific for gp350, a viral surface protein that is present in latently infected people. This 
complex binds to the poly-immunoglobulin receptor at the basal surface of epithelial cells, 
and is endocytosed and delivered apically without infection. To date, there is no report of 

3 5 transcytosis by any AAV. 

1 
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Provided herein are methods for transcytosis across barrier epithelial cells using 
AAV vectors. The ability of a non-pathogenic vector to transcytose barrier epithelial cells 
can be used to deliver genes to sub-epithelial targets. One important example includes the 
delivery of genes across the blood-brain-barrier without the need for direct injection into the 
10 brain. Furthermore, herein is described a method for re-directing virus that enters a cell by 
transcytosis to result in transduction of the cell by blocking exocytosis. 

SUMMARY 

In accordance with the purpose(s) of this invention, as embodied and broadly 
described herein, this invention, in one aspect, relates to a method of delivering a 
15 heterologous nucleic acid across an epithelial barrier comprising delivering to the epithelial 
barrier an AAV vector comprising the heterologous nucleic acid. The epithelial cells can be 
in the gut, lung, genitourinary tract, kidney, blood vessels or brain. 

In another aspect, the invention relates to a method of transcytosing epithelial cells 
of a human subject comprising administering to the subject a viral vector comprising a 
20 heterologous nucleic acid, wherein the viral vector is selected from a group consisting of 
BAAV, AAV4, AAV5, or AAV7. 

Additional advantages of the invention will be set forth in part in the description 

■ 

which follows, and in part will be obvious from the description, or may be learned by 
practice of the invention. The advantages of the invention will be realized and attained by 
25 means of the elements and combinations particularly pointed out in the appended claims. It 
is to be understood that both the foregoing general description and the following detailed 
description are exemplary and explanatory only and are not restrictive of the invention, as 
claimed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

30 The accompanying drawings, which are incorporated in and constitute a part of this 

specification, illustrate (one) several embodiment(s) of the invention and together with the 
description, serve to explain the principles of the invention. 

Figure 1 shows that AAV4 transcytosed in CaCo-2, MDCKI, MDCKII, Human 
primary immortalized epithelial endometrial, Bovine brain primary endothelia cells (BBB). 

35 AAV5 transcytosed CaCo-2 cells, whereas BAAV transcytosed in MDCKs, Endometrial, 
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5 airways epithelia, and BBB. AAV6 did not transcytose in any of cell types tested. Hela cells 

do not form barrier epithelia and were used as a control. 

Figure 2 shows that the treatment of the basal lateral surface of Human primary 

airways epithelial cell (HAE) with tannic acid blocked the transcytosis of B AAV vector 

containing a GFP expression cassette from the apical surface to the basal lateral. 
1 0 Furthermore transduction dramatically increased when assayed at 24 hrs post inoculation. 

In contrast no change was observed in AAV2 transduction, which did not demonstrate any 

transcytosis activity and has limited binding activity on HAE. 

Figure 3 shows AAV7 transcytosis assay on bovine brain endothelial cells. Virus 

DNA extracted from basal lateral medium after 3H incubation 2xl0 9 DRP of AAV were 
15 loaded on the apical side of the cell layer. AAV5 is used as a control. 

DETAILED DESCRIPTION 

The present invention may be understood more readily by reference to the following 
detailed description of the invention and the Examples included therein and to the Figures 
and their previous and following description. 
20 Before the present compounds, compositions, articles, devices, and/or methods are 

disclosed and described, it is to be understood that this invention is not limited to specific 
synthetic methods, specific cell types, or to particular tissues, as such may, of course, vary. 
It is also to be understood that the terminology used herein is for the purpose of describing 

* 

particular embodiments only and is not intended to be limiting. 
25 As used in the specification and the appended claims, the singular forms "a," "an" 

and "the" include plural referents unless the context clearly dictates otherwise. Thus, for 
example, reference to "a pharmaceutical carrier" includes mixtures of two or more such 

carriers, and the like. 

Ranges may be expressed herein as from "abouf ' one particular value, and/or to 

30 "about" another particular value. When such a range is expressed, another embodiment 
includes from the one particular value and/or to the other particular value. Similarly, when 
values are expressed as approximations, by use of the antecedent "about," it will be 
understood that the particular value forms another embodiment. It will be further understood 
that the endpoints of each of the ranges are significant both in relation to the other endpoint, 

35 and independently of the other endpoint. 



3 



It 



WO 2006/029196 PCT7US2005/031837 

5 "Optional" or "optionally" as used herein means that the subsequently described 

event or circumstance may or may not occur, and that the description includes instances 
where said event or circumstance occurs and instances where it does not. 

AAV Transcytosis 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

1 0 barrier comprising delivering to the epithelial barrier an AAV vector comprising the 

heterologous nucleic acid, hi one aspect of the method, the AAV is AAV4, AAV5, AAV7, 
or BAAV. The AAV capsid protein forming the viral particle is understood herein to confer 
upon the AAV particle the desired transcytosing ability. Thus, "AAV vector", as used 
herein, refers to any virion, vector, or viral particle comprising or encoding at least one 

1 5 AAV capsid protein. As an example, an AAV4 vector can encode an AAV4 capsid protein 
and thus be encapsidated in said protein forming an AAV4 particle. Alternatively the AAV 
vector can comprise a nucleic acid encoding a modified AAV or a portion of an AAV capsid 
protein (a capsid protein fragment) that confers serotype-specific trancytotic activity. AAV 
capsids, capsid protein fragments and capsid modifications are disclosed, for example, in 

20 U.S. Patent Application No. 60/526786 (BAAV), U.S. Patent No. 6,468,524 (AAV4), U.S. 
Patent Application No. 09/717,789 (AAV5), U.S. Patent Application 2003/0228282 
(AAV7), International Application No. PCT/US04/15534, filed May 19, 2004 (AAAV), and 
U.S. Patent Application No. 60/676604, filed April 29, 2005 (AAV-X1, AAV-Xlb, AAV- 
X5, AAV-X19, AAV-X21, AAV-X22, AAV-X23, AAV-X24, AAV-X25, AAV-X26). 

25 In another aspect of the method, the epithelial cells are in the gut, lung, genitourinary 

tract, kidney, blood vessels or brain. In another aspect of the method, the epithelial cells can 
be selected from a group consisting of bronchial, alveolar, tracheal or upper airway 
epithelial cells; absorptive enterocytes or M cells; endometrial or urinary epithelial cells; 
renal collecting duct or proximal tubule epithelial cells; cerebral microvascular endothelial 

30 cells or Choroidal Plexus epithelial cells. 

Further disclosed is a method of transcytosing epithelial cells of a human subject 
comprising administering to the subject an AAV vector comprising a heterologous nucleic 
acid. In one aspect of the method, the vector is AAV4, AAV5, AAV7, or BAAV. In another 
aspect of the method, the epithelial cells are selected from a group consisting of bronchial, 

35 alveolar, tracheal or upper airway epithelial cells; absorptive enterocytes or M cells; 
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5 endometrial or urinary epithelial cells; renal collecting duct or proximal tubule epithelial 
cells; cerebral microvascular endothelial cells or Choroidal Plexus epithelial cells. 

Further contemplated are methods for the delivery of molecules across epithelial cell 
barriers comprising coupling the molecules to non-recombinant (wild-type) AAV capsids or 
particles. In one aspect, the molecules are radioligands or enzymes. 

1 o The term "adeno-associated virus (AAV)" is used herein to refer to a genus of 

viruses in the family Parvoviridae which are all defective viruses (unable to replicate by 
themselves) and depend on the co-infection of their host cell by other, nondefective viruses 

» 

to help them replicate. 

Transcytosis refers to the transport of macromolecular cargo from one side of a cell 

15 to the other, generally within a membrane-bounded carriers). Tuma and Hubbard provided 
a review of transcytosis (Tuma PL and Hubbard AL. 2003. Physiol Rev. 83:871-932), herein 
incorporated by reference for its teaching regarding the nature and uses for trancytosis. 
Transcytosis is a strategy used by multicellular organisms to selectively move material 
between two different environments while maintaining the distinct compositions of those 

20 environments. N. Simionescu was the first to coin the term transcytosis to describe the 
vectorial transfer of macromolecular cargo within the plasmalemmal vesicles from the 
* circulation across capillary endothelial cells to the interstitium of tissues. During this same 
period, another type of transcytosis was being discovered. Lnmunologists comparing the 
different types of immunoglobulins found in various secretions (e.g., serum, milk, saliva, 

25 and the intestinal lumen) speculated that the form of IgA found in external secretions (called 
secretory IgA, due to the presence of an additional protein component) was selectively 
transported across the epithelial cell barrier. More is known about transcytosis as it is 
expressed in epithelial tissues, which form cellular barriers between two environments. In 
this polarized cell type, net movement of material can be in either direction, apical to 

30 basolateral or the reverse, depending on the cargo and particular cellular context of the 
process. However, transcytosis is not restricted to only epithelial cells. 

Since the 19th century dye experiments of Ehrlich, the brain has been known as a 
''privileged" organ where access is tightly regulated so that the environment remains 
chemically stable. The two principal gatekeepers of the brain are the cerebral capillary 

35 endothelium and the cuboidal epithelial cells of the choroid plexus. These cellular barriers 
are specialized for the passage of different nutrients from the blood. The capillaries move 
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5 nutrients that are required rapidly and in large quantities, such as glucose and amino acids. 
These small molecules are transported by membrane carriers using facilitated diffusion. The 
choroid plexus supplies nutrients that are required less acutely and in lower quantities. 
These are folate and other vitamins, ascorbate, and deoxyribonucleotides. 

There are two epithelial cells that participate in transcytosis in the intestine, M cells 

10 and enterocytes (adsorptive columnar cells). These cells are very different from one another 
and the capillary endothelial cell. Depending on the species, M cells comprise a variable but 
small percentage of the epithelia overlying organized mucosal-associated lymphoid tissue, 
making them a very minor cell population in the gastrointestinal tract. The transcytotic 
route across M cells is thought to be part of the mechanism by which antigens are routinely 

1 5 sampled along the entire mucosal surface. Not surprisingly, numerous pathogens have 
evolved mechanisms to exploit the transcytotic process as a means to invade and 
disseminate before a strong enough immune response can be mounted. 

Absorptive enterocytes are simple columnar cells with several apical features in 
addition to their brush borders. Clathrin-coated pits are present at the base of microvilli, and 

20 a thick glycocalyx composed of integral membrane proteins with glycosaminoglycan side 
chains emanates from the microvillar membrane. This latter structural feature as well as the 
rigidity of the microvilli are thought to prohibit microorganisms from attaching and 
invading enterocytes. The intracellular organization of these columnar epithelial cells is also 
polarized, with basally located nuclei, supranuclear Golgj, and an abundance of 

25 pleomorphic membrane compartments underlying the terminal web of the brush border. 
The basolateral-to-apical length of this cell is -20 versus 0.2 /an for a capillary endothelial 
cell, making the transcytotic route across enterocytes potentially much longer. Furthermore, 
inicrotubules are an important structural element of the transcytotic pathway in enterocytes, 
but not in M or endothelial cells. 

30 Transcytosis also occurs in the upper regions of the respiratory tract and has been 

demonstrated with two vector systems, plgA-R and FcRn, but others could exist. Secretory 
IgA is a known constituent of the lung's immune defense system, with bronchial epithelial 
cells carrying out basolateral-to-apical transport of dlgA, which is secreted by local plasma 
cells in underlying lymphoid tissue. Albumin, which is found in lung fluid, is endocytosed 

35 specifically at the apical surface of airway epithelia but is then subsequently degraded. At 
the alveolar level, me question of whether albumin is transcytosed intact is uncertain. 
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5 The methods and compositions described herein can be used to deliver heterologous 

nucleic acids to certain tissues. As used herein, the term "nucleic acid" refers to single-or 
multiple stranded molecules which may be DNA or RNA, or any combination thereof, 
including modifications to those nucleic acids. The nucleic acid may represent a coding 
strand or its complement, or any combination thereof. Nucleic acids may be identical in 

1 0 sequence to the sequences which are naturally occurring for any of the novel genes 

discussed herein or may include alternative codons which encode the same amino acid as 
those provided herein, including that which is found in the naturally occurring sequence. 
These nucleic acids can also be modified from their typical structure. Such modifications 
include, but are not limited to, methylated nucleic acids, the substitution of a non-bridging 

1 5 oxygen on the phosphate residue with either a sulfur (yielding phosphorothioate 

deoxynucleotides), selenium (yielding phosphorselenoate deoxynucleotides), or methyl 
groups (yielding methylphosphonate deoxynucleotides). 

As used herein, the term "isolated" refers to a nucleic acid separated or significantly 
free from at least some of the other components of the naturally occurring organism, for 

20 example, the cell structural components or viral components commonly found associated 
with nucleic acids in the environment of the virus and/or other nucleic acids. The isolation 
of the native nucleic acids can be accomplished, for example, by techniques such as cell 
lysis followed by phenol plus chloroform extraction, followed by ethanol precipitation of the 
nucleic acids. The nucleic acids of this invention can be isolated from cells according to any 

25 of many methods well known in the art. 

The AAV vectors disclose herein can comprise a heterologous nucleic acid 
functionally linked to the promoter. The term "heterologous" is used herein to refer to a 
nucleic acid which is derived from a different cell, tissue or organism. The nucleic acid can 
encode a polypeptide or protein or an antisense RNA, for example. By "functionally linked" 

30 is meant such that the promoter can promote expression of the heterologous nucleic acid, as 
is known in the art, such as appropriate orientation of the promoter relative to the 
heterologous nucleic acid. Furthermore, the heterologous nucleic acid preferably has all 
appropriate sequences for expression of the nucleic acid, as known in the art, to functionally 
encode, i.e., allow the nucleic acid to be expressed The nucleic acid can include, for 

35 example, expression control sequences, such as an enhancer, and necessary information 
processing sites, such as ribosome binding sites, RNA splice sites, polyadenylation sites, 
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5 and transcriptional terminator sequences. 

The heterologous nucleic acid can encode beneficial proteins that replace missing or 
defective proteins required by the subject into which the vector in transferred or can encode 
a cytotoxic polypeptide that can be directed, e.g., to cancer cells or other cells whose death 
would be beneficial to the subject. The heterologous nucleic acid can also encode antisense 

10 RNAs that can bind to, and thereby inactivate, mRNAs made by the subject that encode 
harmful proteins. In one embodiment, antisense polynucleotides can be produced from a 
heterologous expression cassette in an AAV4 viral construct where the expression cassette 
contains a sequence that promotes cell-type specific expression (Wirak et al., 1991. EMBO 
10:289). For general methods relating to antisense polynucleotides, see Antisense RNA and 

15 DNA, D. A. Melton, Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1988). 

Examples of heterologous nucleic acids which can be administered to a cell or 
subject as part of the present AAV4 vector can include, but are not limited to the following: 
nucleic acids encoding therapeutic agents, such as tumor necrosis factors (TNF), such as 
TNF-a; interferons, such as interferon-a, interferon-^ and interferon-y, interleukins, such as 

20 IL-1, IL-l/S, and ILs -2 through -14; GM-CSF; adenosine deaminase; cellular growth factors, 
such as lymphokines; soluble CD4; Factor Vffl; Factor IX; T-cell receptors; LDL receptor; 
ApoE; ApoC; alpha-1 antitrypsin; ornithine transcarbamylase (OTC); cystic fibrosis 
transmembrane receptor (CFTR); insulin; Fc receptors for antigen binding domains of 
antibodies, such as immunoglobulins; and antisense sequences which inhibit viral 

25 replication, such as antisense sequences which inhibit replication of hepatitis B or hepatitis 
non-A, non-B virus. The nucleic acid is chosen considering several factors, including the 
cell to be transfected. Where the target cell is a blood cell, for example, particularly useful 
nucleic acids to use are those which allow the blood cells to exert a therapeutic effect, such 
as a gene encoding a clotting factor for use in treatment of hemophilia. Furthermore, the 

30 nucleic acid can encode more than one gene product, limited only, if the nucleic acid is to be 
packaged in a capsid, by the size of nucleic acid that can be packaged. 

The term "polypeptide" as used herein refers to a polymer of amino acids and 
includes full-length proteins and fragments thereof. Thus, "protein," polypeptide," and 
"peptide" are often used interchangeably herein. Substitutions can be selected by known 

35 parameters to be neutral (see, e.g., Robinson WE Jr, and Mitchell WM, 1990. AIDS 

4:S151-S162). As will be appreciated by those skilled in the art, the invention also includes 

8 
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5 those polypeptides having slight variations in amino acid sequences or other properties. 
Such variations may arise naturally as allelic variations (e.g., due to genetic polymorphism) 
or may be produced by human intervention (e.g., by mutagenesis of cloned DNA 
sequences), such as induced point, deletion, insertion and substitution mutants. Minor 
changes in amino acid sequence are generally preferred, such as conservative amino acid 

10 replacements, small internal deletions or insertions, and additions or deletions at the ends of 
the molecules. Substitutions may be designed based on, for example, the model of Dayhofi^ 
et al (in Atlas of Protein Sequence and Structure 1978, Nat'l Biomed. Res. Found., 
Washington, D.C.). These modifications can result in changes in the amino acid sequence, 
provide silent mutations, modify a restriction site, or provide other specific mutations. 

1 5 The term "epithelia" is used herein to refer to cells which are linked tightly together 

by intercellular junctions to form a planar sheet. These sheets of cells form a barrier between 
two compartments. Epithelia therefore line all surfaces and cavities (including skin, 
peritoneum, linings of the intestine, airways, genitourinary tracts, glands, and blood vessels. 
An epithelium has a free or apical surface facing the environment, or lumen of a 

20 cavity, and a basal surface facing the underlying connective tissue. The boundary between 
the basal surface of an epithelium and the underlying connective tissue is usually very sharp, 
and is the site where the basal lamina (BL) is present. Most BL are too thin to be seen with 
the light microscope. However, the BL, together with a thin layer of connective tissue, is 
often times seen at the epitheUal/connective tissue interface. This composite layer, visible 

25 with the light microscope, was initially called the Basement Membrane. Application of the 
electron microscope revealed that, in most cases, this Basement Membrane actually 
consisted of the true basal lamina (lamina lucida plus lamina densa), along with a layer of 

adherent connective tissue. 

For convenience of description, epithelia are classified into different types based on 

30 the number of cell layers and the cell shape. 

Epithelia which are 1 cell layer thick are called "simple" epithelia. Thus, each cell 
rests on the basal lamina, but also has a surface facing the lumen/outside world. Epithelia 
which are 2 or more cell layers thick are called "stratified" epithelia. In stratified epithelia, 
the basal layer of cells rests on the basal lamina, but subsequent layers do not, and are 

35 simply stacked on top of the basal layer. The cells of the most superficial layer have a free 
surface, "squamous" cells are very flat, like a fried egg, where the yolk is the nucleus. The 
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5 nucleus is distinctly flattened, the cell is often so thin that this flattened nucleus bulges the 
cell surface outward, "cuboidal" cells range from true cuboidal where the cell is about as 
high as it is wide, to a flattened cuboidal where the cell is wider than high. In cuboidal cells 
the nucleus is usually round, and not flattened as in squamous, "columnar" cells are 2 or 
more times as high as wide. Nucleus is usually elongated in the long axis of the cell. 

1 0 Squamous cells form the lining of cavities such as the mouth, blood vessels, heart 

and lungs and make up the outer layers of the skin. Cuboidal epithelium is found in glands 
and in the lining of the kidney tubules as well as in the ducts of the glands. They also 
constitute the germinal epithelium which produces the egg cells in the female ovary and the 
speim cells in the male testes. Columnar epithelium forms the lining of the stomach and 

1 5 intestines. Some columnar cells are specialized for sensory reception such as in the nose, 
ears and the taste buds of the tongue. 

Ciliated columnar epithelial cells posses fine hair-like outgrowths, cilia on their free 
surfaces. These cilia are capable of rapid, rhythmic, wavelike beatings in a certain direction. 
Ciliated epithelium is usually found in the air passages like the nose. It is also found in the 

20 uterus and Fallopian tubes of females. 

Columnar epithelium with goblet cells is called glandular epithelium. Some parts of 
the glandular epithelium consist of such a large number of goblet cells that there are only a 
few normal epithelial cells left. Columnar and cuboidal epithelial cells often become 
specialized as gland cells which are capable of synthesizing and secreting certain substances 

25 such as enzymes, hormones, milk, mucus, sweat, wax and saliva. Unicellular glands consist 
of single, isolated glandular cells such as the goblet cells. Sometimes a portion of the 
epithelial tissue becomes invaginated and a multicellular gland is formed. Multicellular 
glands are composed of clusters of cells. Most glands are multicellular including the salivary 
glands. 

30 Where body linings have to withstand wear and tear, the epithelia are composed of 

several layers of cells and are then called compound or stratified epithelium. The top cells 
are flat and scaly and it may or may not be keratinized (i.e. containing a tough, resistant 
protein called keratin). The mammalian skin is an example of dry, keratinized, stratified 
epithelium. The lining of the mouth cavity is an example of an unkeratinized, stratified 

35 epithelium. 
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5 In vitro Cell Models of Transcytosis 

The use of in vitro cell models to study transcytosis has many advantages over in 
vivo systems. First, variation among animals is eliminated, as is the confounding issue of 
cargo possibly being modified or endocytosed by cell types other than the one under study. 
Moreover, in vitro systems can be manipulated in ways not possible in vivo, allowing 
1 0 investigators to measure the effects of different variables (e.g., temperatures, 

pharmacological agents, etc.) with greater precision and to explore the molecular 

mechanisms of transcytosis. 

The integrity of the monolayer is obviously vital to every study of transcytosis, and 
there are different methods for assessing it. TransepitheHal electrical resistance (TER) 

1 5 measurements are commonly used as an indication of tight junction integrity in a monolayer, 
and commercial instruments are available for these measurements. 

Caco-2 cells, human primary colon carcinoma cells, are a well studied model of 
intestinal absoiptive enterocytes. They are the most commonly used intestinal cell line 
because they differentiate furthest along the cryptto-villus axis and are the easiest to 

20 transfect Caco-2 cells have been especially used to model transcytosis of bacteria, which 
can cross barrier epithelia in the gut and brain (Zhang JR, et al., 2000. Cell 102(6):827-37), 
incorporated herein by reference. 

There is little evidence for in vivo transcytosis of macromolecular cargo in kidney. 
Nonetheless, MDCK cells, which are derived from dog kidney, are the most-studied 

25 epithelial cell model and have been used extensively to study transcytosis. These cells were 
originally developed by nephrologists for permeability and electrical studies. Their 
subsequent use by cell biologists for studies of the formation of tight junctions, 
establishment of polarity, and vesicle traffic have popularized MDCK cells. An advantage is 
that MDCK cells are easily cultured, easily transfected, and become polarized 3-5 days after 

30 seeding. They were used in the now classical studies showing that enveloped viruses bud in 
a polarized fashion and that the newly synthesized viral membrane glycoproteins are 
targeted directly from the TGN to the appropriate PM domain. Furthermore, much of the 
current understanding of the IgA transcytotic pathway and the sorting signals in the plgA-R 
comes from the elegant studies performed in MDCK cells. Two MDCK strains with very 

35 different features were identified some time ago. The MDCK I cell has a high TER and 
characteristics reminiscent of the renal collecting duct, whereas the more commonly used 
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5 MDCK II strain, whose TER is one order of magnitude lower than that of MDCK I cells, 

has phenotypic features closer to those of the renal proximal tubule. 

Both primary cells and cell lines, alone and in coculture with endothelial cells, are 

being used to study transcytosis in the lung. Clonetics bronchial/tracheal epithelial cell 

systems contain normal human bronchial/treacheal epithelial cells. This cell system has been 
10 used for experimental applications in cancer research, respiratory disease, cellular function 

and differentiation. 

The Clonetics® bovine Brain Microvascular Endothelial Cell System (bMVEC-B) is 
a model of the "Blood Brain Barrier". The system is designed to significantly improve a 
researcher's ability to study active and passive transport of drugs across the blood brain 

15 barrier, to study brain endothelial cell tight junctions, and to study the basic biology of brain 
microvascular endothelial cells (Schinket AH , 1 999. Advanced Drug Delivery Reviews 
36:179-194; Tsukita S. et al., 1998. Molecular dissection of tight junctions:occluding and 
ZO-1 in Introduction to the Blood -Brain Barrier. Edited by William M Partridge; Inglis et 
al., 2004. Brain Research 998: 218-229), each of which is incorporated by reference for its 

20 teaching of in vitro endothelial cell modeling of the blood-brain barrier. 

Endometrial cells form an important barrier layer in the genitourinary tract. The 
cells used to model this system were developed by Kyo et al. and are derived from primary 
cells immortalized by the addition of the papillomiavirus E6/E7 genes and human 
telomerase reverse transcriptase. The isolated cells have a normal chromosomes and retain 

25 their responsiveness to sex-steriod hormones, exhibit glandular structure on three 

dimensional culture , and lack a transformed phenotype (Kyo S, et al. Am J Pathol., 2003. 
163(6):2259-69), incorporated herein by reference for its teaching of this endometrial 
model. 

Methods of Use 

30 The use of AAVs to deliver genes to the lung would be of benefit in genetic diseases 

like cystic fibrosis, pseudohypoaldosteronism, and immotile cilia syndrome. Furthermore, 
delivering genes to the lung would be of impact in several non-genetic diseases. For 
example, delivering genes that make antibiotic like peptides to the cells underlying the 
epithelia would be useful to prevent or treat bronchitis; delivering genes that make growth 

35 factors would be of value in common diseases like chronic bronchitis. Also, AAVs could be 
used to deliver genes that may play a role in asthma, like IL-10, or antibodies to IgE and 
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5 interleuktas. The use of an AAV vector to deliver genes through the alveolar epithelia 
would be of benefit in genetic diseases like alpha- 1 -antitrypsin deficiency. Furthermore, 
delivering genes through the alveolar epithelia would be of significance in several 
pulmonary non-genetic diseases. For example, delivering genes that make antibiotic like 
peptides would be useful to prevent or treat pneumonia (perhaps of antibiotic-resistant 

10 organisms); delivering genes that make growth factors would be of value in emphysema; 
delivering genes thatover-express the epithelial sodium channel or the Na-K ATPase could 
be used to treat cardiogenic and non-cardiogenic pulmonary edema; delivering genes that 
have an anti-fibrosis effect like interferon for pulmonary fibrosis would also be useful. Also, 
AAVs could be used to deliver genes that may have a systemic effect like anti-hypertension 

15 drugs, insulin, coagulation factors, antibiotics, growth factors, hormones and others. 

The use of AAVs to deliver genes to the central nervous system (CNS)/ brain would 
be of benefit in neurological diseases, including Alzheimer f s Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, schizophrenia, mania, dementia, paranoia, : 
obsessive compulsive disorder, panic disorder, learning disabilities, ALS, triplet expansions 

20 diseases, psychoses, autism, lysosomal storage diseases, Gaucher' s disease, Hurler's 

disease, Krabbe's disease, battens disease, and altered behaviors (e.g., disorders in feeding, 
sleep patterns, balance, and perception). 

The use of AAVs to deliver genes to the gastrointestinal system/ gut would be of 
benefit in treatment of diseases and/or Gastrointestinal Disorders such as colon cancers, 

25 inflammatory bowel disease, diabetes, or Crohn's disease. 

The use of AAVs to deliver genes to the genitourinary system would be of benefit in 
treatment of diseases of the female reproductive tract, molecular defects in implantation 
disorders, and gynecological cancers. These methods would also have contraceptive 

* 

applications. 

30 The use of AAVs to deliver genes to the kidney would be of benefit in treatment of 

inherited renal disorders such as polycystic kidney disease, Alport's syndrome, hereditary 

nephritis, primary hyperoxaluria, and cystinuria. 

The use of AAVs for wide-spread delivery of genes across blood vessels into the 

muscle would be of benefit in neuromuscular diseases like muscular dystrophy and 
35 Cardiovascular Disorders such as heart disease, restenosis, atherosclerosis, myocarditis, 

stoke, angina, or thrombosis. 
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5 The use of AAVs for wide-spread delivery of genes across blood vessels into any/all 

tissues of a subject would be of benefit in the treatment of certain cancers (e.g., gastric, 

ovarian, lung, bladder, liver, and breast). 

The use of AAVs for wide-spread delivery of genes across blood vessels into any/all 
tissues of a subject would be of benefit in the treatment of certain inflammatory disorders, 

10 including, but not limited to, adrenalitis, alveolitis, angiocholecystitis, appendicitis, 
balanitis, blepharitis, bronchitis, bursitis, carditis, cellulitis, cervicitis, cholecystitis, 
chorditis, cochlitis, colitis, conjunctivitis, cystitis, dermatitis, diverticulitis, encephalitis, 
endocarditis, esophagitis, eustachitis, fibrositis, folliculitis, gastritis, gastroenteritis, 
gingivitis, glossitis, hepatosplemtis, keratitis, labyrinthitis, laryngitis, lymphangitis, mastitis, 

1 5 media otitis, meningitis, metritis, mucitis, myocarditis, myosititis, myringitis, nephritis, 
neuritis, orchitis, osteochondritis, otitis, pericarditis, peritendonitis, peritonitis, pharyngitis, 
phlebitis, poliomyelitis, prostatitis, pulpitis, retinitis, rhinitis, salpingitis, scleritis, 
sclerochoroiditis, scrotitis, sinusitis, spondylitis, steatitis, stomatitis, synovitis, syringitis, 
tendonitis, tonsillitis, urethritis, and vaginitis; and disorders that are characterized by 

20 inflammation such as hepatitis, rheumatoid arthritis, gout, trauma, pancreatitis, sarcoidosis, 
dermatitis, renal ischemia-reperfusion injury. Grave's disease, systemic lupus erythematosus, 
diabetes mellitus, and allogenic transplant rejection. 

The use of AAVs for wide-spread delivery of genes across blood vessels into any/all 
tissues of a subject would be of benefit in the treatment of other diseases, syndromes and 

25 conditions, such as adenosine deaminase deficiency, sickle cell deficiency, thalassemia, 

hemophilia, diabetes, phenylketonuria, growth disorders, and defects of the immune system. 

BAAV 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier of the lung, comprising delivering to the lung a BAAV vector comprising the nucleic 
30 acid. In one aspect of the method, the epithelial barrier comprises human bronchial, 

alveolar, tracheal or upper airway epithelial cells. Thus, disclosed is a method of delivering 
a heterologous nucleic acid across human airway epithelial cells, comprising delivering to 
the cells a BAAV vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
35 barrier in the brain, comprising delivering to the brain a BAAV vector comprising the 
nucleic acid In one aspect of the method, the epithelial barrier comprises human cerebral 
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5 microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood brain 

barrier. Thus, disclosed is a method of delivering a heterologous nucleic acid across human 
cerebral microvascular endothelial cells, comprising delivering to the cells a B AAV vector 

comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across the epithelial 

1 0 barrier of blood vessels into the muscle, comprising delivering to the blood stream a B AAV 
vector comprising the nucleic acid. In one aspect of the method, the epithelial barrier 
comprises human vascular endothelial cells. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the genitourinary tract, comprising delivering to the genitourinary tract a B AAV 

1 5 vector comprising the nucleic acid genitourinary tract. In one aspect of the method, the 

epithelial barrier comprises human endometrial or urinary epithelial cells. Thus, disclosed is 
a method of delivering a heterologous nucleic acid across human endometrial epithelial 
cells, comprising delivering to the cells a B AAV vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an- epithelial 

20 barrier in the kidney, comprising delivering to the genitourinary tract a B AAV vector 

comprising the nucleic acid genitourinary tract. In one aspect of the method, the epithelial 
barrier comprises human renal collecting ducts or proximal tubules. Thus, disclosed is a 
method of delivering a heterologous nucleic acid across human kidney epithelial cells, 
comprising delivering to the cells a BAAV vector comprising the nucleic acid. 

25 Disclosed is amethod of transcytosing lung epithelial cells of a subject comprising 

contacting the lung epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
bronchial, tracheal, or upper airway epithelial cells. 

Disclosed is a method of transcytosing CNS epithelial cells of a subject comprising 

30 contacting the CNS epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
cerebral microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

Disclosed is a method of transcytosing vascular epithelial cells of a subject 
35 comprising contacting the vascular epithelial cells of the subject with a BAAV vector 
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5 comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 
human vascular endothelial cells of the blood brain barrier. 

Disclosed is a method of transcytosing genitourinary tract epithelial cells of a subject 
comprising contacting the genitourinary tract epithelial cells of the subject with a BAAV 
vector comprising a heterologous nucleic acid. In one aspect of the method, the epithelial 

1 0 cells are human endometrial or urinary tract epithelial cells. 

Disclosed is a method of transcytosing genitourinary tract epithelial cells of a subject 
comprising contacting the kidney epithelial cells of the subject with a BAAV vector 
comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 
human renal collecting ducts or proximal tubules 

15 AAV5 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the gut, comprising delivering to the gut an AAV5 vector comprising the nucleic 
acid. In one aspect of the method, the epithelial barrier comprises human absorptive 
enterocytes or M cells. Thus, disclosed is a method of delivering a heterologous nucleic acid 

20 across human gut epithelial cells enterocytes, comprising delivering to the cells an AAV5 
vector comprising the nucleic acid. 

Disclosed is a method of transcytosing gut epithelial cells of a subject comprising 
contacting the gut epithelial cells of the subject with an AAV5 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 

25 absorptive enterocytes. 

AAV4 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the gut, comprising delivering to the gut an AAV4 vector comprising the nucleic 
acid. In one aspect of the method, the epithelial barrier comprises human absorptive 
30 enterocytes or M cells. Thus, disclosed is a method of delivering a heterologous nucleic acid 
across human gut epithelial cells enterocytes, comprising delivering to the cells an AAV4 
vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the lung, comprising delivering to the lung an AAV4 vector comprising the 
35 nucleic acid. In one aspect of the method, the epithelial barrier comprises human bronchial, 
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5 tracheal, or upper airway epithelial cells. Thus, disclosed is a method of delivering a 

heterologous nucleic acid across human airway epithelial cells, comprising delivering to the 
cells an AAV4 vector comprising the nucleic acid. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 
barrier in the CNS, comprising delivering to the CNS an AAV4 vector comprising the 

1 0 nucleic acid. In one aspect of the method, the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood brain 
barrier. Thus, disclosed is a method of delivering a heterologous nucleic acid across human 
cerebral microvascular endothelial cells, comprising delivering to the cells a AAV4 vector 

comprising the nucleic acid. 

1 5 Disclosed is a method of delivering a heterologous nucleic acid across the epithelial 

barrier of blood vessels into the muscle, comprising delivering to the blood stream an AAV4 
vector comprising the nucleic acid. In one aspect of the method, the epithelial barrier 
comprises human vascular endothelial cells of the blood brain barrier. 

Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

20 barrier in the genitourinary tract, comprising delivering to the genitourinary tract an AAV4 
vector comprising the nucleic acid. In one aspect of the method, the epithelial barrier 
comprises human endometrial or urinary epithelial cells. Thus, disclosed is a method of 
delivering a heterologous nucleic acid across human endometrial epithelial cells, comprising 
delivering to the cells an AAV4 vector comprising the nucleic acid. 

25 Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

barrier in the kidneys, comprising delivering to the kidneys an AAV4 vector comprising the 
nucleic acid. In one aspect of the method, the epithelial barrier comprises human renal 
collecting ducts or proximal tubules. Thus, disclosed is a method of delivering a 
heterologous nucleic acid across human kidney epithelial cells, comprising delivering to the 

30 cells an AAV4 vector comprising the nucleic acid. 

Disclosed is a method of transcytosing lung epithelial cells of a subject comprising 
contacting the lung epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
bronchial, tracheal, or upper airway epithelial cells. 

35 Disclosed is a method of transcytosing CNS epithelial cells of a subject comprising 

contacting the CNS epithelial cells of the subject with an AAV4 vector comprising a 
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5 heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
cerebral microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

Disclosed is a method of transcytosing vascular epithelial cells of a subject 
comprising contacting the vascular epithelial cells of the subject with an AAV4 vector 

10 comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 
vascular endothelial cells of the blood brain barrier. 

Disclosed is a method of transcytosing genitourinary tract epithelial cells of a subject 
comprising contacting the genitourinary epithelial cells of the subject with an AAV4 vector 
comprising a heterologous nucleic acid. In one aspect of the method, the epithelial cells are 

1 5 human endometrial or urinary epithelial cells. 

Disclosed is a method of transcytosing kidney epithelial cells of a subject comprising 
contacting the kidney epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. In <?ne aspect of the method, the epithelial cells are human renal 
collecting ducts or proximal tubules 

20 Disclosed is a method of transcytosing gut epithelial cells of a subject comprising 

contacting the gut epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
absorptive enterocytes. 

AAV7 

25 Disclosed is a method of delivering a heterologous nucleic acid across an epithelial 

barrier in the CNS, comprising delivering to the CNS an AAV7 vector comprising the 
nucleic acid, hi one aspect of the method, the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood brain 
barrier. Thus, disclosed is a method of delivering a heterologous nucleic acid across human 

30 cerebral microvascular endothelial cells, comprising delivering to the cells an AAV7 vector 

comprising the nucleic acid. 

Disclosed is a method of transcytosing CNS epithelial cells of a subject comprising 
contacting the CNS epithelial cells of the subject with an AAV7 vector comprising a 
heterologous nucleic acid. In one aspect of the method, the epithelial cells are human 
35 cerebral microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 
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5 Inhibition of Transcytosis to Increase Transduction 

Described herein is a method for re-directing virus that enters a cell by transcytosis 
to result in transduction of the cell by blocking exocytosis. Thus, provided is a method of 
improving the efficiency of nucleic acid delivery to epithelial cells, comprising delivering to 
the cells an inhibitor of exocytosis and an AAV vector containing the nucleic acid. Also 
1 0 provided is a method for transducing cells that have transcytosis activity but are normally 
resistant to transduction comprising administering to the cells inhibitors of exocytosis. 

In one aspect of the methods, the AAV vector is derived from AAV4, AAV5, or > 
BAAV. In a further aspect of the methods, the epithelial cell barriers are located in the 
kidney, gut, lung or vascular endothelium 
1 5 Thus, disclosed is a method of delivering a heterologous nucleic acid to human 

airway epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
an AAV4 vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
kidney epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
20 an AAV4 vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
vascular endothelial cells, comprising delivering to the cells and an inhibitor of exocytosis 
and an AAV4 vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
25 airway epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
a BAAV vector comprising the nucleic acid. 

Further disclosed is a method of delivering a heterologous nucleic acid to human 
kidney epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and 
a BAAV vector comprising the nucleic acid. 
30 Further disclosed is a method of delivering a heterologous nucleic acid to human 

vascular endothelial cells, comprising delivering to the cells and an inhibitor of exocytosis 
and a BAAV vector comprising the nucleic acid 

Further disclosed is a method of delivering a heterologous nucleic acid to human gut 
epithelial cells, comprising delivering to the cells and an inhibitor of exocytosis and an 
35 AAV5 vector comprising the nucleic acid. 
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5 In one aspect of the disclosed methods, the inhibitors of exocytosis are chemical 

modifiers. In a further aspect of the methods, the chemical modifier is tannic acid, wherein 
the tannic acid is delivered to the basal lateral surface of the epithelial cells. 

Compositions and methods for making AAV4 vectors 

Compositions and methods for making and using AAV4 vectors have been 

10 previously described in U.S. Patent No. 6,468,524, which is hereby incorporated herein by 
reference for this teaching. 

Provided is the nucleotide sequence of the adeno-associated virus 4 (AAV4) genome 
and vectors and particles derived therefrom. Specifically, provided is a nucleic acid vector 
comprising a pair of AAV4 inverted terminal repeats (TTRs) and-a promoter between the 

1 5 inverted terminal repeats. The AAV4 ITRs are exemplified by the nucleotide sequence set 
forth in SEQ ID NO:6 and SEQ ID NO:20; however, these sequences can have minor 
modifications and still be contemplated to constitute AAV4 ITRs. The nucleic acid listed in 
SEQ ID NO:6 depicts the ITR in the "flip" orientation of the ITR. The nucleic acid fisted in 
SEQ ID NO:20 depicts the ITR in the "flop" orientation of the ITR. Minor modifications in 

20 an ITR of either orientation are those that will not interfere with the hairpin structure formed 
by the AAV4 ITR as described herein and known in the art. Furthermore, to be considered 
within the term "AAV4 ITRs" the nucleotide sequence must retain the Rep binding site 
described herein and exemplified in SEQ ID NO:6 and SEQ ID NO:20, i.e. 9 it must retain 
one or both features described herein that distinguish the AAV4 ITR from the AAV2 ITR: 

25 (1) four (rather than three as in AAV2) "GAGC" repeats and (2) in the AAV4 ITR Rep 
binding site the fourth nucleotide in the first two "GAGC" repeats is a T rather than a C. 

The promoter can be any desired promoter, selected by known considerations, such 
as the level of expression of a nucleic acid functionally linked to the promoter and the cell 
type in which the vector is to be used. Promoters can be an exogenous or an endogenous 

30 promoter. Promoters can include, for example, known strong promoters such as SV40 or the 
inducible metallothionein promoter, or an AAV promoter, such as an AAV p5 promoter. 
Additional examples of promoters include promoters derived from actin genes, 
immunoglobulin genes, cytomegalovirus (CMV), adenovirus, bovine papilloma virus, 
adenoviral promoters, such as the adenoviral major late promoter, an inducible heat shock 

35 promoter, respiratory syncytial virus, Rous sarcomas virus (RSV), etc. Specifically, the 
promoter can be AAV2 p5 promoter or AAV4 p5 promoter. More specifically, the AAV4 

20 
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5 p5 promoter can be about nucleotides 130 to 291 of SEQ ID NO: 1. Additionally, the p5 
promoter may be enhanced by nucleotides 1-130. Furthermore, smaller fragments of p5 
promoter that retain promoter activity can readily be determined by standard procedures 
including, for example, constructing a series of deletions in the p5 promoter, linking the 
deletion to a reporter gene, and determining whether the reporter gene is expressed, i.e., 

10 transcribed and/or translated. 

The present invention also contemplates any unique fragment of these AAV4 nucleic 
acids, including the AAV4 nucleic acids set forth in SEQ ID NOs: 1, 3, 5, 6, 7, 12-15, 17 
and 19. Fragments can be, for example, at least about 30, 40, 50, 75, 100, 200 or 500 
nucleotides in length. The nucleic acid can be single or double stranded, depending upon the 

1 5 purpose for which it is intended. 

The present invention further provides an AAV4 Capsid polypeptide or a unique 
fragment thereof. AAV4 capsid polypeptide is encoded by ORF 2 of AAV4. Specifically, 
provided is an AAV4 Capsid protein comprising the amino acid sequence encoded by 
nucleotides 2260-4464 of the nucleotide sequence set forth in SEQ TD NO:l, or a unique 

20 fragment of such protein. The present invention also provides an AAV4 Capsid protein 
consisting essentially of the amino acid sequence encoded by nucleotides 2260-4464 of the 
nucleotide sequence set forth in SEQ ID NO: 1, or a unique fragment of such protein. The 
present invention further provides the individual AAV4 coat proteins, VP 1, VP2 and VP3. 
Thus, provided is an isolated polypeptide having the amino acid sequence set forth in SEQ 

25 ID NO:4 (VP1). The present invention additionally provides an isolated polypeptide having 
the amino acid sequence set forth in SEQ ID NO:16 (VP2). The present invention also 
provides an isolated polypeptide having the amino acid sequence set forth in SEQ ID NO: 1 8 
(VP3). By "unique fragment thereof is meant any smaller polypeptide fragment encoded by 
any AAV4 capsid gene that is of sufficient length to be unique to the AAV4 Capsid protein. 

30 Substitutions and modifications of the amino acid sequence can be made as described above 
and, further, can include protein processing modifications, such as glycosylation, to the 
polypeptide. However, an AAV4 Capsid polypeptide including all three coat proteins will 
have at least about 63% overall homology to the polypeptide encoded by nucleotides 2260- 
4464 of the sequence set forth in SEQ ID NO: 1 . The protein can have about 65%, about 

35 70%, about 75%, about 80%, about 85%, about 90%, about 95% or even 100% homology to 
the amino acid sequence encoded by the nucleotides 4467of the sequence set forth in SEQ 
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5 ID NO: 1 . An AAV4 VP2 polypeptide can have at least about 58%, about 60%, about 70%, 
about 80%, about 90% about 95% or about 100% homology to the amino acid sequence set 
forth in SEQ ID NO: 16. An AAV4 VP3 polypeptide can have at least about 60%, about 
70%, about 80%, about 90% about 95% or about 100% homology to the amino acid 
sequence set forth in SEQ ID NO: 18. 

10 The herein described AAV4 nucleic acid vector can be encapsidated in an AAV 

particle. In particular, it can be encapsidated in an AAV1 particle, an AAV2 particle, an 
AAV3 particle, an AAV4 particle, or an AAV5 particle by standard methods using the 
appropriate capsid proteins in the encapsidation process, as long as the nucleic acid vector 
fits within the size limitation of the particle utilized. The encapsidation process itself is 

1 5 standard in the art. 

An AAV4 particle is a viral particle comprising an AAV4 capsid protein. An AAV4 
capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide can overall have at 
least about 63% homology to the polypeptide having the amino acid sequence encoded by 
nucleotides 2260-4464 set forth in SEQ ID NO: 1 (AAV4 capsid protein). The capsid protein 

20 can have about 70% homology, about 75% homology, 80% homology, 85% homology, 90% 
homology, 95% homology, 98% homology, 99% homology, or even 100% homology to the 
protein having the amino acid sequence encoded by nucleotides 2260-4464 set forth in SEQ 
ID NO: 1 . The particle can be a particle comprising both AAV4 and AAV2 capsid protein, 
Le. 9 a chimeric protein. Variations in the amino acid sequence of the AAV4 capsid protein 

25 are contemplated herein, as long as the resulting viral particle comprising the AAV4 capsid 
remains antigenically or immunologically distinct from AAV2, as can be routinely 
determined by standard methods. Specifically, for example, ELISA and Western blots can 
be used to determine whether a viral particle is antigenically or immunologically distinct 
from AAV2. Furthermore, the AAV4 viral particle preferably retains tissue tropism 

30 distinction from AAV2, such as that exemplified in the examples herein, though an AAV4 
chimeric particle comprising at least one AAV4 coat protein may have a different tissue 
tropism from that of an AAV4 particle consisting only of AAV4 coat proteins. 

An AAV4 particle is a viral particle comprising an AAV4 capsid protein. An AAV4 
capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide can overall have at 

35 least about 63% homology to the polypeptide having the amino acid sequence encoded by 
nucleotides 2260-4467 set forth in SEQ ID NO:l (AAV4 capsid protein). The capsid protein 
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5 can have about 70% homology, about 75% homology, 80% homology, 85% homology, 90% 
homology, 95% homology, 98% homology, 99% homology, or even 100% homology to the 
protein having the amino acid sequence encoded by nucleotides 2260-4467 set forth in SEQ 
ID NO: 1 . The particle can comprise only VP 1 and VP3 and still stably transduce cells. 
The particle can be a particle comprising both AAV4 and AAV2 capsid protein, i.e., a 

1 0 chimeric protein. Variations in the amino acid sequence of the AAV4 capsid protein are 
contemplated herein, as long as the resulting viral particle comprising the AAV4 capsid 
. remains antigenically or immunologically distinct from AAV2, as can be routinely 

determined by standard methods. Specifically, for example, ELISA and Western blots can 
be used to determine whether a viral particle is antigenically or immunologically distinct 

1 5 from AAV2. Furthermore, the AAV4 viral particle preferably retains tissue tropism 

distinction from AAV2, such as that exemplified in the examples herein, though an AAV4 
chimeric particle comprising at least one AAV4 coat protein may have a different tissue 
tropism from that of an AAV4 particle consisting only of AAV4 coat proteins. 

The invention further provides an AAV4 particle containing, i. e. , encapsidating, a 

20 vector comprising a pair of AAV2 inverted terminal repeats. The nucleotide sequence of 
AAV2 ITRs is known in the art. Furthermore, the particle can be a particle comprising both 
AAV4 and AAV2 capsid protein, i.e., a chimeric protein. The vector encapsidated in the 
particle can further comprise an exogenous nucleic acid inserted between the inverted 
terminal repeats. 

25 The present invention further provides an isolated nucleic acid comprising the 

nucleotide sequence set forth in SEQ ID NO: 1 (AAV4 genome). This nucleic acid, or 
portions thereof, can be inserted into other vectors, such as plasmids, yeast artificial 
chromosomes, or other viral vectors, if desired, by standard cloning methods. The present 
invention also provides an isolated nucleic acid consisting essentially of the nucleotide 

30 sequence set forth in SEQ ID NO: 1 . The nucleotides of SEQ ID NO: 1 can have minor 
modifications and still be contemplated by the present invention. For example, 
modifications that do not alter the amino acid encoded by any given codon (such as by 
modification of the third, "wobble," position in a codon) can readily be made, and such 
alterations are known in the art. Furthermore, modifications that cause a resulting neutral 

35 amino acid substitution of a similar amino acid can be made in a coding region of the 

genome. Additionally, modifications as described herein for the AAV4 components, such as 
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5 the ITRs, the p5 promoter, etc. are contemplated in this invention. 

The present invention additionally pro vides an isolated nucleic acid that selectively 
hybridizes with an isolated nucleic acid consisting essentially of the nucleotide sequence set 
forth in SEQ ID NO:l (AAV4 genome). The present invention further provides an isolated 
nucleic acid that selectively hybridizes with an isolated nucleic acid comprising the 

10 nucleotide sequence set forth in SEQ ID NO:l (AAV4 genome). By "selectively hybridizes" 
as used in the claims is meant a nucleic acid that specifically hybridizes to the particular 
target nucleic acid under sufficient stringency conditions to selectively hybridize to the 
target nucleic acid without significant background hybridization to a nucleic acid encoding 
an unrelated protein, and particularly, without detectably hybridizing to AAV2. Thus, a 

15 nucleic acid that selectively hybridizes with a nucleic acid of the present invention will not 
selectively hybridize under stringent conditions with a nucleic acid encoding a different 
protein, and vice versa. Therefore, nucleic acids for use, for example, as primers and probes 
to detect or amplify the target nucleic acids are contemplated herein. Nucleic acid fragments 
that selectively hybridize to any given nucleic acid can be used, e.g., as primers and or 

20 probes for further hybridization or for amplification methods (e.g. , polymerase chain 

reaction (PCR), ligase chain reaction (LCR)). Additionally, for example, a primer or probe 
can be designed that selectively hybridizes with both AAV4 and a gene of interest carried 
within the AAV4 vector (i. e. , a chimeric nucleic acid). 

The present invention further provides an isolated nucleic acid encoding an adeno- 

25 associated virus 4 Rep protein. The AAV4 Rep proteins are encoded by open reading frame 
(ORF) 1 of the AAV4 genome. The AAV4 Rep genes are exemplified by the nucleic acid 
set forth in SEQ ID NO:3 (AAV4 ORF1), and include a nucleic acid consisting essentially 
of the nucleotide sequence set forth in SEQ ID NO:3 and a nucleic acid comprising the 
nucleotide sequence set forth in SEQ ID NO:3. The present invention also includes a nucleic 

30 acid encoding the amino acid sequence set forth in SEQ ID NO: 2 (polypeptide encoded by 
AAV4 ORF1). However, the present invention includes that the Rep genes nucleic acid can 
include any one, two, three, or four of the four Rep proteins, in any order, in such a nucleic 
acid. Furthermore, minor modifications are contemplated in the nucleic acid, such as silent 
mutations in the coding sequences, mutations mat make neutral or conservative changes in 

35 the encoded amino acid sequence, and mutations in regulatory regions that do not disrupt 
the expression of the gene. Examples of other minor modifications are known in the art. 
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5 Further modifications can be made in the nucleic acid, such as to disrupt or alto: expression 
of one or more of the Rep proteins in order to, for example, determine the effect of such a 
disruption; such as to mutate one or more of the Rep proteins to determine the resulting . 
effect, etc. However, in general, a modified nucleic acid encoding all four Rep proteins will 
have at least about 90%, about 93%, about 95%, about 98% or 100% homology to the 

1 0 sequence set forth in SEQ ID NO:3, and the Rep polypeptide encoded therein will have 
overall about 93%, about 95%, about 98%, about 99% or 100% homology with the amino 
acid sequence set forth in SEQ ID NO:2. 

The present invention also provides an isolated nucleic acid that selectively 
hybridizes with a nucleic acid consisting essentially of the nucleotide sequence set forth in 

1 5 SEQ ID NO:3 and an isolated nucleic acid that selectively hybridizes with a nucleic acid 
comprising the nucleotide sequence set forth in SEQ ID NO:3. "Selectively hybridizing" is 

defined elsewhere herein. 

The present invention also provides each individual AAV4 Rep protein and the 
nucleic acid encoding each. Thus provided is the nucleic acid encoding a Rep 40 protein, 

20 and in particular an isolated nucleic acid comprising the nucleotide sequence set forth in 

SEQ ID NO: 12, an isolated nucleic acid consisting essentially of the nucleotide sequence set 
forth in SEQ ID NO: 12, and a nucleic acid encoding the adeno-associated virus 4 Rep 
protein having the amino acid sequence set forth in SEQ ID NO:8. The present invention- 
also provides the nucleic acid encoding a Rep 52 protein, and in particular an isolated 

25 nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 13, an isolated 
nucleic acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO:13, 
and a nucleic acid encoding the adeno-associated virus 4 Rep protein having the amino acid 
sequence set forth in SEQ ID NO:9. The present invention further provides the nucleic acid 
encoding a Rep 68 protein, and in particular an isolated nucleic acid comprising the 

30 nucleotide sequence set forth in SEQ ID NO: 14, an isolated nucleic acid consisting 
essentially of the nucleotide sequence set forth in SEQ ID NO:14, and a nucleic acid 
encoding the adeno-associated virus 4 Rep protein having the amino acid sequence set forth 
in SEQ ID NO: 10. And, further, provided is the nucleic acid encoding a Rep 78 protein, and 
in particular an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 

35 NO:15, an isolated nucleic acid consisting essentially of the nucleotide sequence set forth in 
SEQ ID NO: 15, and a nucleic acid encoding the adeno-associated virus 4 Rep protein 
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5 having the amino acid sequence set forth in SEQ ID NO: 1 1 . As described elsewhere herein, 
these nucleic acids can have minor modifications, including silent nucleotide substitutions, 
mutations causing neutral amino acid substitutions in the encoded proteins, and mutations in 
control regions that do not or minimally affect the encoded amino acid sequence. 

The present invention further provides a nucleic acid encoding the entire AAV4 

1 0 Capsid polypeptide. Specifically, provided is a nucleic acid having the nucleotide sequence 
set for the nucleotides 2260-4467 of SEQ ID NO: 1 . Furthermore, provided is a nucleic acid 
encoding each of the three AAV4 coat proteins, VP1, VP2, and VP3. Thus, provided is a 
nucleic acid encoding AAV4 VP 1 , a nucleic acid encoding AAV4 VP2, and a nucleic acid 
encoding AAV4 VP3. Thus, provided is a nucleic acid encoding the amino acid sequence 

15 set forth in SEQ ID NO:4 (VP1); a nucleic acid encoding the amino acid sequence set forth 
in SEQ ID NO: 16 (VP2), and a nucleic acid encoding the amino acid sequence set forth in 
SEQ ID NO:18 (VP3). The present invention also specifically provides a nucleic acid 
comprising SEQ ID NO:5 (VP1 gene); a nucleic acid comprising SEQ ID NO:17 (VP2 
gene); and a nucleic acid comprising SEQ ID NO:19 (VP3 gene). The present invention also 

20 specifically provides a nucleic acid consisting essentially of SEQ ID NO:5 (VP1 gene), a 
nucleic acid consisting essentially of SEQ ID NO: 1 7 (VP2 gene), and a nucleic acid 
consisting essentially of SEQ ID NO: 19 (VP3 gene). Furthermore, a nucleic acid encoding 
an AAV4 capsid protein VP1 is set forth as nucleotides 2260-4467 of SEQ ID NO:l; a 
nucleic acid encoding an AAV4 capsid protein VP2 is set forth as nucleotides 2668-4467 of 

25 SEQ ID NO: 1 ; and a nucleic acid encoding an AAV4 capsid protein VP3 is set forth as 
nucleotides 2848-4467 of SEQ ID NO:l. Minor modifications in the nucleotide sequences 
encoding the capsid, or coat, proteins are contemplated, as described above for other AAV4 
nucleic acids. 

Provided is an isolated AAV4 Rep protein. AAV4 Rep polypeptide is encoded by 
30 ORF1 of AAV4. Specifically, provided is an AAV4 Rep polypeptide comprising the amino 
acid sequence set forth in SEQ ID NO:2, or a unique fragment thereof. The present 
invention also provides an AAV4 Rep polypeptide consisting essentially of the amino acid 
sequence set forth in SEQ ID NO:2, or a unique fragment thereof. Additionally, nucleotides 
291-2306 of the AAV4 genome, which genome is set forth in SEQ ID NO:l, encode the 
35 AAV4 Rep polypeptide. The present invention also provides each AAV4 Rep protein. Thus 
provided is AAV4 Rep 40, or a unique fragment thereof. The present invention particularly 

■ 
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5 provides Rep 40 having the amino acid sequence set forth in SEQ ID NO:8. Provided is 
AAV4 Rep 52, or a unique fragment thereof. The present invention particularly provides 
Rep 52 having the amino acid sequence set forth in SEQ ID NO:9. Provided is AAV4 Rep 
68, or a unique fragment thereof. The present invention particularly provides Rep 68 having, 
the amino acid sequence set forth in SEQ ID NO: 10. Provided is AAV4 Rep 78, or a unique 

1 0 fragment thereof. The present invention particularly provides Rep 78 having the amino acid 
sequence set forth in SEQ ID NO: 1 1 . By "unique fragment thereof is meant any smaller 
polypeptide fragment encoded by AAV rep gene that is of sufficient length to be unique to 
the Rep polypeptide. Substitutions and modifications of the amino acid sequence can be 
made as described above and, further, can include protein processing modifications, such as 

1 5 glycosylation, to the polypeptide. However, a polypeptide including all four Rep proteins 
will encode a polypeptide having at least about 91% overall homology to the sequence set 
forth in SEQ ID NO:2, and it can have about 93%, about 95%, about 98%, about 99% or 
1 00% homology with the amino acid sequence set forth in SEQ ID NO:2. 

The present invention further provides an AAV4 Capsid polypeptide or a unique 

20 fragment thereof. AAV4 capsid polypeptide is encoded by ORF 2 of AAV4. Specifically, 
provided is an AAV4 Capsid protein comprising the amino acid sequence encoded by 
nucleotides 2260-4467.of the nucleotide sequence set forth in SEQ ID NO: 1, or a unique 
fragment of such protein. The present invention also provides an AAV4 Capsid protein 
consisting essentially of the amino acid sequence encoded by nucleotides 2260-4467 of the 

25 nucleotide sequence set forth in SEQ ID NO:l, or a unique fragment of such protein. The 
present invention further provides the individual AAV4 coat proteins, VP1, VP2 and VP 3. 
Thus, provided is an isolated polypeptide having the amino acid sequence set forth in SEQ 
ID NO:4 (VP1). The present invention additionally provides an isolated polypeptide having 
the amino acid sequence set forth in SEQ ID NO: 16 (VP2). The present invention also 

30 provides an isolated polypeptide having the amino acid sequence set forth in SEQ ID NO: 1 8 
(VP3). By "unique fragment thereof is meant any smaller polypeptide fragment encoded by 
any AAV4 capsid gene mat is of sufficient length to be unique to the AAV4 Capsid protein. 
Substitutions and modifications of the amino acid sequence can be made as described above 
and, further, can include protein processing modifications, such as glycosylation, to the 

35 polypeptide. However, an AAV4 Capsid polypeptide including all three coat proteins will 
have at least about 63% overall homology to the polypeptide encoded by nucleotides 2260- 
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5 4467 of the sequence set forth in SEQ ID NO: 1. The protein can have about 65%, about 
70%, about 75%, about 80%, about 85%, about 90%, about 95% or even 100% homology to 
the amino acid sequence encoded by the nucleotides 2260-4467 of the sequence set forth in 
SEQ ID NO:4. An AAV4 VP2 polypeptide can have at least about 58%, about 60%, about 
70%, about 80%, about 90% about 95% or about 100% homology to the amino acid 

10 sequence set forth in SEQ ID NO:16. An AAV4 VP3 polypeptide can have at least about 
60%, about 70%, about 80%, about 90% about 95% or about 100% homology to the amino 
acid sequence set forth in SEQ ID NO: 18. 

The AAV inverted terminal repeats in the vector for the herein described delivery 
methods can be AAV4 inverted terminal repeats. Specifically, they can comprise the nucleic 

1 5 acid whose nucleotide sequence is set forth in SEQ ID NO:6 or the nucleic acid whose 

nucleotide sequence is set forth in SEQ ID NO:20, or any fragment thereof demonstrated to 
have ITR functioning. The ITRs can also consist essentially of the nucleic acid whose 
nucleotide sequence is set forth in SEQ ID NO:6 or the nucleic acid whose nucleotide 
sequence is set forth in SEQ ID NO:20. Furthermore, the AAV inverted terminal repeats in 

20 the vector for the herein described nucleic acid delivery methods can also comprise AAV2 
inverted terminal repeats. Additionally, the AAV inverted terminal repeats in the vector for 
this delivery method can also consist essentially of AAV2 inverted terminal repeats. 

Compositions and methods for making AAV5 vectors 

Compositions and methods for making and using AAV5 vectors have been 
25 previously described in U.S. Patent Application No. 09/717,789, filed November 21, 2000, 
and in U.S. Patent No. 6,855,3 14, which are hereby incorporated herein by reference for this 
teaching. 

The present application provides a recombinant adeno-associated virus 5 (AAV5). 
This virus has one or more of the characteristics described below. The compositions of the 
30 present invention do not include wild-type AAV5. The methods of the present invention can 
use either wild-type AAV5 or recombinant AAV5-based delivery. 

Provided are novel AAV5 particles, recombinant AAV5 vectors, recombinant AAV5 
virions and novel AAV5 nucleic acids and polypeptides. An AAV5 particle is a viral 
particle comprising an AAV5 capsid protein. A recombinant AAV5 vector is a nucleic acid 
35 construct that comprises at least one unique nucleic acid of AAV5. A recombinant AAV5 
virion is a particle containing a recombinant AAV5 vector, wherin the particle can be either 
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5 an AAV5 particle as described herein or a non-AAV5 particle. Alternatively, the 

recombinant: AAV5 virion is an AAV5 particle containing a recombinant vector, wherein 
the vector can be either an AAV5 vector as described herein or a non-AAV5 vector. These 
vectors, particles, virions, nucleic acids and polypeptides are described below. 

Provided is the nucleotide sequence of the AAV5 genome and vectors and particles 

1 0 derived therefrom. Specifically, provided is a nucleic acid vector comprising a pair of 
AAV5 inverted terminal repeats (ITRs) and a promoter between the inverted terminal 
repeats. While the rep proteins of AAV2 and AAV5 will bind to either a type 2 ITR or a 
type 5 ITR, efficient genome replication only occurs when type 2 Rep replicates a type 2 
ITR and a type 5 Rep replicates a type 5 ITR. This specificity is the result of a difference in 

15 DNA cleavage specificity of the two Reps which is necessary for replication. AAV5 Rep 
cleaves at CGGT A GTGA (SEQ ID NO: 43) and AAV2 Rep cleaves at CGGT A TGAG (SEQ 
ID NO: 44) (Chiorini et al., 1999. J. Virol. 73 (5) 4293-4298). Mapping of the AAV5 ITR 
terminal resolution site (TRS) identified this distinct cleavage site, CGGT A GTGA, which is 
absent from the ITRs of other AAV serotypes. Therefore, the minimum sequence necessary 

20 to distinguish AAV5 from AAV2 is the TRS site where Rep cleaves in order to replicate the 
virus. Examples of the type 5 ITRs are shown in SEQ ID NO: 41 and SEQ ID NO: 42, 
AAV5 ITR "flip" and AAV5 "flop", respectively. Minor modifications in an ITR of either 
orientation are contemplated and are those that will not interfere with the hairpin structure 
formed by the AAV5 ITR as described herein. Furthermore, to be considered within the 

25 tenn "AAV5 ITR" the nucleotide sequence must retain one or more features described 
herein that distinguish the AAV5 ITR from the ITRs of other serotypes, e.g. it must retain 
the Rep binding site described herein. 

The D- region of the AAV5 ITR (SEQ ID NO: 45), a single stranded region of the 
ITR, inboard of the TRS site, has been shown to bind a factor which depending on its 

30 phosphorylation state correlates with the conversion of the AAV from a single stranded 
genome to a transcriptionally active form that allows for expression of the viral DNA. This 
region is conserved between AAV2, 3, 4,and 6 but is divergent in AAV5. The D+ region is 
the reverse complement of the D- region. 

The promoter can be any desired promoter, selected by known considerations, such 

35 as the level of expression of a nucleic acid functionally linked to the promoter and the cell 
type in which the vector is to be used. That is, the promoter can be tissue/cell-specific. 
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5 Promoters can be prokaryotic, eukaryotic, fungal, nuclear, mitochondrial, viral or plant 
promoters. Promoters can be exogenous or endogenous to the cell type being transduced by 
the vector. Promoters can include, for example, bacterial promoters, known strong 
promoters such as S V40 or the inducible metallothionein promoter, or an AAV promoter, 
such as an AAV p5 promoter. Additionally, chimeric regulatory promoters for targeted gene 

10 expression can be utilized. Examples of these regulatory systems, which are known in the 
art, include the tetracycline based regulatory system which utilizes the tet transactivator 
protein (tTA), a chimeric protein containing the VP 16 activation domain fused to the tet 
repressor of Escherichia coli, the IPTG based regulatory system, the CID based regulatory 
system, and the Ecdysone based regulatory system. Other promoters include promoters 

15 derived from actin genes, immunoglobulin genes, cytomegalovirus (CMV), adenovirus, 
bovine papilloma virus, adenoviral promoters, such as the adenoviral major late promoter, 
an inducible heat shock promoter, respiratory syncytial virus, Rous sarcomas virus (RS V), 
etc., specifically, the promoter can be AAV2 p5 promoter or AAV5 p5 promoter. More 
specifically, the AAV5 p5 promoter can be about same location in SEQ ED NO: 23 as the 

20 AAV2 p5 promoter, in the corresponding AAV2 published sequence. An example of an 
AAV5 p5 promoter is nucleotides 220-338 of SEQ ID NO: 23. Additionally, the p5 
promoter may be enhanced by nucleotides 1-130 of SEQ ID NO: 23. Furthermore, smaller 
fragments of p5 promoter that retain promoter activity can readily be determined by standard 
procedures including, for example, constructing a series of deletions in the p5 promoter, 

25 linking the deletion to a reporter gene, and determining whether the reporter gene is 

expressed, i.e., transcribed and/or translated. The promoter can be the promoter of any of the 
AAV serotypes, and can be the pl9 promoter (SEQ ID NO: 38) or the p40 promoter set 
forth in the sequence listing as SEQ ID NO: 39. 

It should be recognized that any errors in any of the nucleotide sequences disclosed 

30 herein can be corrected, for example, by using the hybridization procedure described below 
with various probes derived from the described sequences such that the coding sequence can 
be reisolated and resequenced. Rapid screening for point mutations can also be achieved 
with the use of polymerase chain reaction single strand conformation polymorphism (PCR 
SSCP). The corresponding amino acid sequence can then be corrected accordingly. 

35 The AAV5-derived vector can include any normally occurring AAV5 sequences in 

addition to an ITR and promoter. Examples of vector constructs are provided below. 
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5 The present vector or AAV5 particle or recombinant AAV5 virion can utilize any 

unique fragment of the present AAV5 nucleic acids, including the AAV5 nucleic acids set 
forth in SBQ ID NOS: 23 and 29-33, 35, 37, 38, 39 and 40. To be unique, the fragment must 
be of sufficient size to distinguish it from other known sequences, most readily determined 
by comparing any nucleic acid fragment to the nucleotide sequences of nucleic acids in 
10 computer databases, such as GenBank. Such comparative searches are standard in the art. 
Typically, a unique fragment useful as a primer or probe will be at least about 8 or 10, 
preferable at least 20 or 25 nucleotides in length, depending upon the specific nucleotide 
content of the sequence. Additionally, fragments can be, for example, at least about 30, 40, 
50, 75, 1 00, 200 or 500 nucleotides in length and can encode polypeptides or be probes. The 
1 5 nucleic acid can be single or double stranded, depending upon the purpose for which it is 
intended. Where desired, the nucleic acid can be RNA. 

The present invention further provides an isolated AAV5 capsid protein to contain 
the vector. In particular, provided is not only a polypeptide comprising all three AAV5 coat 
proteins, i.e., VP1, VP2 and VP3, but also a polypeptide comprising each AAV5 coat 
20 protein individually, SEQ ID NOS: 26, 27, and 28, respectively. Thus an AAV5 particle 

comprising an AAV5 capsid protein comprises at least one AAV5 coat protein VP1, VP2 or 
VP3. An AAV5 particle comprising an AAV5 capsid protein can be utilized to deliver a 
nucleic acid vector to a cell, tissue or subject. For example, the herein described AAV5 
vectors can be encapsidated in an AAV5 capsid-derived particle and utilized in a gene 
25 delivery method. Furthermore, other viral nucleic acids can be encapsidated in the AAV5 
particle and utilized in such delivery methods. For example, an AAV1, 2,3,4,or 6 vector 
(e.g. AAVl,2,3,4,or 6 ITR and nucleic acid of interest )can be encapsidated in an AAV5 
particle and administered. Furthermore, an AAV5 chimeric capsid incorporating both AAV2 
capsid and AAV5 capsid sequences can be generated, by standard cloning methods, 
30 selecting regions from the known sequences of each protein as desired. For example, 
particularly antigenic regions of the AAV2 capsid protein can be replaced with the 
corresponding region of the AAV5 capsid protein. In addition to chimeric capsids 
incorporating AAV2 capsid sequences, chimeric capsids incorporating AAV1, 3, 4, or 6 and 
AAV5 capsid sequences can be generated, by standard cloning methods, selecting regions 
35 from the known sequences of each protein as desired. The particle can also comprise only 
VP 1 and VP3 capsid proteins. 
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5 The capsids can also be modified to alter their specific tropism by genetically 

altering the capsid to encode a specific ligand to a cell surface receptor. Alternatively, the 
capsid can be chemically modified by conjugating a ligand to a cell surface receptor. By 
genetically or chemically altering the capsids, the tropism can be modified to direct AAV5 
to a particular cell or population of cells. The capsids can also be altered immunologically 

10 by conjugating the capsid to an antibody that recognizes a specific protein on the target cell 

or population of cells. 

The capsids can also be assembled into empty particles by expression in mammalian, 
bacterial, fungal or insect cells. For example, AAV2 particles are known to be made from 
VP3 and VP2 capsid proteins in baculovirus. The same basic protocol can produce an empty 

1 5 AAV5 particle comprising an AAV5 capsid protein. 

The herein described recombinant AAV5 nucleic acid derived vector can be 
encapsidated in an AAV particle. In particular, it can be encapsidated in an AAV1 particle, 
an AAV2 particle, an AAV3 particle, an AAV4 particle, an AAV5 particle or an AAV6 
particle, a portion of any of these capsids, or a chimeric capsid particle as described above, 

20 by standard methods using the appropriate capsid proteins in the encapsidation process, as 
long as the nucleic acid vector fits within the size limitation of the particle utilized. The 
encapsidation process itself is standard in the art. The AAV5 replication machinery, i.e. the 
rep initiator proteins and other functions required for replication, can be utilized to produce 
the AAV5 genome that can be packaged in an AAV1, 2, 3, 4, 5 or 6 capsid. 

25 The recombinant AAV5 virion containing a vector can also be produced by 

recombinant methods utilizing multiple plasmids. In one example, the AAV5 rep nucleic 
acid would be cloned into one plasmid, the AAV5 ITR nucleic acid would be cloned into 
another plasmid and the AAV1, 2, 3, 4, 5 or 6 capsid nucleic acid would be cloned on 
another plasmid. These plasmids would then be introduced into cells. The cells that were 

30 efficiently transduced by all three plasmids, would exhibit specific integration as well as the 
ability to produce recombinant AAV5 virion. Additionally, two plasmids could be used 
where the AAV5 rep nucleic acid would be cloned into one plasmid and the AAV5 ITR and 
AAV5 capsid would be cloned into another plasmid. These plasmids would then be 
introduced into cells. The cells that were efficiently transduced by both plasmids, would 

35 exhibit specific integration as well as the ability to produce recombinant AAV5 virion. 
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5 An AAV5 capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide 

can have greater than 56% overall homology to the polypeptide having the amino acid 
sequence encoded by nucleotides in SEQ ID NOS: 29/30, 31, as shown in figures 4 and 5. 
The capsid protein can have about 70% homology, about 75% homology, 80% homology, 
85% homology, 90% homology, 95% homology, 98% homology, 99% homology, or even 

10 100% homology to the protein having the amino acid sequence encoded by the nucleotides 
set forth in SEQ ID NOS: 29, 30, or 3 L The percent homology used to identify proteins 
herein, can be based on a nucleotide-by-nucleotide comparison or more preferable is based 
on a computerized algorithm as described herein. Variations in the amino acid sequence of 
the AAV5 capsid protein are contemplated herein, as long as the resulting particle 

1 5 comprising an AAV5 capsid protein remains antigenically or immunologically distinct from 
AAV1 , AAV2, AAV3, AAV4 or AAV6 capsid, as can be routinely detennined by standard 
methods. Specifically, for example, ELISA and Western blots can be used to determine 
whether a viral particle is antigenically or immunologically distinct from AAV2 or the other 
serotypes. Furthermore, the AAV5 particle preferably retains tissue tropism distinction 

20 from AAV2, such as that exemplified in the examples herein. An AAV5 chimeric particle 
comprising at least one AAV5 coat protein may have a different tissue tropism from that of 
an AAV5 particle consisting only of AAV5 coat proteins, but is still distinct from the 
tropism of an AAV2 particle, in that it will infect some cells not infected by AAV2 or an 
AAV2 particle. 

25 The invention further provides a recombinant AAV5 virion, comprising an AAV5 

particle containing, i.e., encapsidating, a vector comprising a pair of AAV5 inverted 
terminal repeats. The recombinant vector can further comprise an AAV5 Rep-encoding 
nucleic acid. The vector encapsidated in the particle can further comprise an exogenous 
nucleic acid inserted between the inverted terminal repeats. AAV5 Rep confers targeted 

30 integration and efficient replication, thus production of recombinant AAV5, comprising 
AAV5 Rep, yields more particles than production of recombinant AAV2. Since AAV5 is 
more efficient at replicating and packaging its genome, the exogenous nucleic acid inserted, 
or in the AAV5 capsids of the present invention, between the. inverted terminal repeats can 
be packaged in the AAV1, 2, 3, 4, or 6 capsids to achieve the specific tissue tropism 

35 conferred by the capsid proteins. 
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* * 

5 The invention further contemplates chimeric recombinant ITRs that contains a rep 

binding site and a TRS site recognized by that Rep protein. By "Rep protein" is meant all 
four of the Rep proteins, Rep 40, Rep 78, Rep 52, Rep 68. Alternatively, "Rep protein" 
could be one or more of the Rep proteins described herein. One example of a chimeric ITR 
would consist of an AAV5 D region (SEQ ID NO: 45), an AAV5 TRS site (SEQ ID NO: 

10 43), an AAV2 hairpin and an AAV2 binding site. Another example would be an AAV5 D 
region, an AAV5 TRS site, an AAV3 hairpin and an AAV3 binding site. In these chimeric 
ITRs, the D region can be from AAV1 , 2, 3, 4, 5 or 6. The hairpin can be derived from AAV 
1,2 3, 4, 5, 6. The binding site can be derived from any of AAV1, 2, 3, 4, 5 or 6. Preferably, 
the D region and the TRS are from the same serotype. 

15 The chimeric ITRs can be combined with AAV5 Rep protein and any of the AAV 

serotype capsids to obtain recombinant virion. For example, recombinant virion can be 
produced by an AAV5 D region, an AAV5 TRS site, an AAV2 hairpin, an AAV2 binding 
site, AAV5 Rep protein and AAV1 capsid. This recombinant virion would possess the 
cellular tropism confeired by the AAV1 capsid protein and would possess the efficient 

20 replication conferred by the AAV5 Rep. 

Other examples of the ITR, Rep protein and Capsids that will produce recombinant 

virion are provided in the list below: 
5ITR + 5Rep + 5Cap=virion 
5ITR + 5Rep 4- lCap=virion 

25 5ITR + 5Rep + 2Cap=virion 

- 

5ITR + 5Rep + 3Cap==virion 
5ITR + 5Rep + 4Cap=virion 
5ITR + 5Rep + 6Cap=virion 
1ITR + IRep + 5Cap=virion 
30 2ITR + 2Rep + 5Cap=*virion 

3ITR + 3Rep + 5Cap=virion 
4ITR + 4Rep + 5Cap=virion 
6TTR + 6Rep + 5Cap=virion 

In any of the constructs described herein, inclusion of a promoter is preferred. As 
35 used in the constructs herein, unless otherwise specified, Cap (capsid) refers to any of 

AAV5 VP1, AAV5 VP2, AAV5 VP3, combinations thereof, functional fragments of any of 
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5 VP 1 , VP2 or VP3, or chimeric capsids as described herein. The URs of the constructs 
described herein, can be chimeric recombinant URs as described elsewhere in the 
application. 

Conjugates of recombinant or wild-type AAV5 virions and nucleic acids or proteins 
can be used to deliver those molecules to a cell. For example, the purified AA.V5 can be 

1 0 used as a vehicle for delivering DNA bound to the exterior of the virus. Examples of this are 
to conjugate the DNA to the virion by a bridge using poly L lysine or other charged 
molecule. Also contemplated are virosomes that contain AAV5 structural proteins (AAV5 
capsid proteins), lipids such as DOTAP, and nucleic acids that are complexed via charge 
interaction to introduce DNA into cells. 

1 5 Also provided by this invention are conjugates that utilize theAAV5 capsid or a 

unique region of the AAV5 capsid protein (e.g. VP1, VP2 or VP3 or combinations thereof) 
to introduce DNA into cells. For example, the type 5 VP3 protein or fragment thereof, can 
be conjugated to a DNA on a plasmid that is conjugated to a lipid. Cells can be infected 
using the targeting ability of the VP3 capsid protein to achieve the desired tissue tropism, 

20 specific to AAV5. Type 5 VP1 and VP2 proteins can also be utilized to introduce DNA or 
other molecules into cells. By further incorporating the Rep protein and the AAV TRS into 
the DNA-containing conjugate, cells can be transduced and targeted integration can be 
achieved. For example, if AAV5 specific targeted integration is desired, a conjugate 
composed of the AAV5 VP3 capsid, AAV5 rep or a fragment of AAV5 rep, AAV5 TRS, 

25 the rep binding site, the heterologous DNA of interest, and a lipid, can be utilized to achieve 
AAV5 specific tropism and AAV5 specific targeted integration in the genome. 

Further provided by this invention are chimeric viruses where AAV5 can be 
combined with herpes virus, herpes virus amplicons, baculovirus or other viruses to achieve 
a desired tropism associated with another virus. For example, the AAV5 ITRs could be 

30 inserted in the herpes virus and cells could be infected. Post-infection, the ITRs of AAV5 
could be acted on by AAV5 rep provided in the system or in a separate vehicle to rescue 
AAV5 from the genome. Therefore, the cellular tropism of the herpes simplex virus can be 
combined with AAV5 rep mediated targeted integration. Other viruses that could be utilized 
to construct chimeric viruses include, lentivirus, retrovirus, pseudotyped retroviral vectors, 

3 5 and adenoviral vectors . 
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5 The present invention further provides isolated nucleic acids of AAV5. For example, 

provided is an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 
NO: 23 (AAV5 genome). This nucleic acid, or portions thereof, can be inserted into vectors, 
such as plasmids, yeast artificial chromosomes, or other viral vector (particle), if desired, by 
standard cloning methods. The present invention also provides an isolated nucleic acid 

1 0 consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 23. The 

nucleotides of SEQ ID NO: 23 can have minor modifications and still be contemplated by 
the present invention. For example, modifications that do not alter the amino acid encoded 
by any given codon (such as by modification of the third, "wobble" position in a codon) can 
readily be made, and such alterations are known in the art. Furthermore, modifications that 

1 5 cause a resulting neutral (conserved) amino acid substitution of a similar amino acid can be 
made in a coding region of the genome. Additionally, modifications as described herein for 
the AAV5 components, such as the ITRs, the p5 promoter, etc. are contemplated in this 
invention. Furthermore, modifications to regions of SEQ ID NO: 23 other than in the ITR, 
TRS Rep binding site and haiipin are likely to be tolerated without serious impact on the 

20 function of the nucleic acid as a recombinant vector. 

The present invention additionally provides an isolated nucleic acid that selectively 
hybridizes with any nucleic acid disclosed herein, including the entire AAV5 genome and 
any unique fragment thereof, including the Rep and capsid encoding sequences (e.g. SEQ ID 
NOS: 23, 29, 30, 31, 32, 33, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, and 45). Specifically, the 

25 nucleic acid can selectively or specifically hybridize to an isolated nucleic acid consisting of 
the nucleotide sequence set forth in SEQ ID NO: 23 (AAV5 genome). The present invention 
further provides an isolated nucleic acid that selectively or specifically hybridizes with an 
isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 23 
(AAV5 genome). By "selectively hybridizes" as used herein is meant a nucleic acid that 

30 hybridizes to one of the disclosed nucleic acids under sufficient stringency conditions 
without significant hybridization to a nucleic acid encoding an unrelated protein, and 
particularly, without detectably hybridizing to nucleic acids of AAV2. Thus, a nucleic acid 
that selectively hybridizes with a nucleic acid of the present invention will not selectively 
hybridize under stringent conditions with a nucleic acid encoding a different protein or the 

35 corresponding protein from a different serotype of the virus, and vice versa. A "specifically 
hybridizing" nucleic acid is one that hybridizes under stringent conditions to only a nucleic 
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5 acid found in AAV5. Therefore, nucleic acids for use, for example, as primers and probes to 
detect or amplify the target nucleic acids are contemplated herein. Nucleic acid fragments 
that selectively hybridize to any given nucleic acid can be used, e.g., as primers and or 
probes for further hybridization or for amplification methods (e.g., polymerase chain 
reaction (PCR), ligase chain reaction (LCR)). Additionally, for example, a primer or probe 

10 can be designed that selectively hybridizes with both AAV5 and a gene of interest carried 
within the AAV5 vector (i.e., a chimeric nucleic acid). 

A nucleic acid that selectively hybridizes to any portion of the AAV5 genome is 
contemplated herein. Therefore, a nucleic acid that selectively hybridizes to AAV5 can be of 
longer length than the AAV5 genome, it can be about the same length as the AAV5 genome 

15 or it can be shorter than the AAV5 genome. The length of the nucleic acid is limited on the 
shorter end of the size range only by its specificity for hybridization to AAV5, i.e., once it is 
too short, typically less than about 5 to 7 nucleotides in length, it will no longer bind 
specifically to AAV5, but rather will hybridize to numerous background nucleic acids. 
Additionally contemplated by this invention is a nucleic acid that has a portion that 

20 specifically hybridizes to AAV5 and a portion that specifically hybridizes to a gene of 
interest inserted within AAV5 . 

The present invention further provides an isolated nucleic acid encoding an adeno- 
associated virus 5 Rep protein. The AAV5 Rep proteins are encoded by open reading frame 
(ORF) 1 of the AAV5 genome. Examples of the AAV5 Rep genes are shown in the nucleic 

25 acid set forth in SEQ ID NO: 23, and include nucleic acids consisting essentially of the 

nucleotide sequences set forth in SEQ ID NOS: 32 (Rep52), 33 (Rep78), 35 (Rep40), and 37 
(Rep68), and nucleic acids comprising the nucleotide sequences set forth in SEQ ID NOS: 
32, 33, 35, and 37. However, the present invention contemplates that the Rep nucleic acid 
can include any one, two, three, or four of the four Rep proteins, in any order, in such a 

30 nucleic acid. Furthermore, minor modifications are contemplated in the nucleic acid, such as 
silent mutations in the coding sequences, mutations that make neutral or conservative 
changes in the encoded amino acid sequence, and mutations in regulatory regions that do not 
disrupt the expression of the gene. Examples of other minor modifications are known in the 
art. Further modifications can be made in the nucleic acid, such as to disrupt or alter 

35 expression of one or more of the Rep proteins in order to, for example, determine the effect 
of such a disruption; such as to mutate one or more of the Rep proteins to determine the 
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5 resulting effect, etc. However, in general, a modified nucleic acid encoding a Rep protein 
will have at least about 85%, about 90%, about 93%, about 95%, about 98% or 100% 
homology to the Rep nucleic sequences described herein e.g., SEQ ID NOS: , 11, 13 and 15 
32, 33, 35 and 37, and the Rep polypeptide encoded therein will have overall about 93%, 
about 95%, about 98%, about 99% or 1 00% homology with the amino acid sequence 

10 described herein, e.g., SEQ ID NOS: 24, 25, 34 and 36. Percent homology is determined by 
the techniques described herein. 

The present invention also provides an isolated nucleic acid that selectively or 
specifically hybridizes with a nucleic acid consisting essentially of the nucleotide sequence 
' set forth in SEQ ID NOS: 32, 33, 35 and 37, and an isolated nucleic acid that selectively 

1 5 hybridizes with a nucleic acid comprising the nucleotide sequence set forth in SEQ ID NOS: 
32, 33, 35 and 37. "Selectively hybridizing" and "stringency of hybridization" is defined 
elsewhere herein. 

As described above, provided is the nucleic acid encoding a Rep 40 protein and, in 
particular an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 

20 NO: 35, an isolated nucleic acid consisting essentially of the nucleotide sequence set forth in 
SEQ ID NO: 35, and a nucleic acid encoding the adeno-associated virus 5 protein having the 
amino acid sequence set forth in SEQ ID NO: 34. The present invention also provides the 
nucleic acid encoding a Rep 52 protein, and in particular an isolated nucleic acid comprising 
the nucleotide sequence set forth in SEQ ID NO: 32, an isolated nucleic acid consisting 

25 essentially of the nucleotide sequence set forth in SEQ ID NO: 32, and a nucleic acid 

encoding the adeno-associated virus 5 Rep protein having the amino acid sequence set forth 
in SEQ ID NO: 24. The present invention further provides the nucleic acid encoding a Rep 
68 protein and, in particular an isolated nucleic acid comprising the nucleotide sequence set 
forth in SEQ ID NO: 37, an isolated nucleic acid consisting essentially of the nucleotide 

30 sequence set forth in SEQ ID NO: 37, and a nucleic acid encoding the adeno-associated 
virus 5 protein having the amino acid sequence set forth in SEQ ID NO: 36. And, further, 
provided is the nucleic acid encoding a Rep 78 protein, and in particular an isolated nucleic 
acid comprising the nucleotide sequence set forth in SEQ ID NO: 33, an isolated nucleic 
acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 33, and a 

35 nucleic acid encoding the adeno-associated virus 5 Rep protein having the amino acid 
sequence set forth in SEQ ID NO: 25. As described elsewhere herein, these nucleic acids 
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can have minor modifications, including silent nucleotide substitutions, mutations causing 
conservative amino acid substitutions in the encoded proteins, and mutations in control 
regions that do not or minimally affect the encoded amino acid sequence. 

The present invention further provides a nucleic acid encoding the entire AAV5 
Capsid polypeptide. Furthermore, provided is a nucleic acid encoding each of the three 
AAV5 coat proteins, VP1, VP2, and VP3. Thus, provided is a nucleic acid encoding AAV5 
VP1, a nucleic acid encoding AAV5 VP2, and a nucleic acid encoding AAV5 VP3. Thus, 
provided is a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 26 
(VP1); a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 27(VP2), 
and a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 28 (VP3). 
The present invention also specifically provides a nucleic acid comprising SEQ ID NO: 29 
(VP1 gene); a nucleic acid comprising SEQ ID NO: 30 (VP2 gene); and a nucleic acid 
comprising SEQ ID NO: 31 (VP3 gene). The present invention also specifically provides a 
nucleic acid consisting essentially of SEQ ID NO: 29 (VP1 gene), a nucleic acid consisting 
essentially of SEQ ID NO: 30 (VP2 gene), and a nucleic acid consisting essentially of SEQ 
ID NO: 31 (VP3 gene). Minor modifications in the nucleotide sequences encoding the 
capsid, or coat, proteins are contemplated, as described above for other AAV5 nucleic acids. 
However, in general, a modified nucleic acid encoding a capsid protein will have at least 
about 85%, about 90%, about 93%, about 95%, about 98% or 100% homology to the capsid 
nucleic sequences described herein e.g., SEQ ID NOS : 29, 30 and 3 1 , and the capsid 
polypeptide encoded therein will have overall about 93%, about 95%, about 98%, about 
99% or 100% homology with the amino acid sequence described herein, e.g., SEQ ID NOS : 
26, 27, and 28. Nucleic acids that selectively hybridize with the nucleic acids of SEQ ID 
NOS: 29, 30, and 31 under the conditions described above are also provided. 

Provided is an isolated AAV5 Rep protein. An AAV5 Rep polypeptide is encoded 
by ORF1 of AAV5. The present invention also provides each individual AAV5 Rep protein. 
Thus provided is AAV5 Rep 40 (e.g., SEQ ID NO: 34), or a unique fragment thereof. 
Provided is AAV5 Rep 52 (e.g., SEQ ID NO: 24), or a unique fragment thereof. Provided is 
AAV5 Rep 68 (e.g., SEQ ID NO: 36), or a unique fragment thereof. Provided is an example 
of AAV5 Rep 78 (e.g., SEQ ID NO: 25), or a unique fragment thereof. By "unique fragment 
thereof is meant any smaller polypeptide fragment encoded by an AAV5 rep gene that is of 
sufficient length to be found only in the Rep polypeptide. Substitutions and modifications of 
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5 the amino acid sequence can be made as described above and, further, can include protein 
processing modifications, such as glycosylation, to the polypeptide. 

The present invention further provides an AAV5 Capsid polypeptide or a unique 
fragment thereof. AAV5 capsid polypeptide is encoded by ORF 2 of AAV5. The present 
invention further provides the individual AAV5 capsid proteins, VP1, VP2 and VP3 or 

10 unique fragments thereof. Thus, provided is an isolated polypeptide having the amino acid 
sequence set forth in SEQ ID NO: 26 (VP1). The present invention additionally provides an 
isolated polypeptide having the amino acid sequence set forth in SEQ ID NO: 27 (VP2). The 
present invention also provides an isolated polypeptide having the amino acid sequence set 
forth in SEQ ID NO: 28 (VP3). By "unique fragment thereof is meant any smaller 

15 polypeptide fragment encoded by any AAV5 capsid gene that is of sufficient length to be 
found only in the AAV5 capsid protein. Substitutions and modifications of the amino acid 
sequence can be made as described above and, further, can include protein processing 
modifications, such as glycosylation, to the polypeptide. However, an AAV5 Capsid 
polypeptide including all three coat proteins will have greater than about 56% overall 

20 homology to the polypeptide encoded by the nucleotides set forth in SEQ ID NOS: 26, 27, 
or 28. The protein can have about 65%, about 70%, about 75%, about 80%, about 85%, 
about 90%, 93%, 95%, 97% or even 100% homology to the amino acid sequence encoded 
by the nucleotides set forth in SEQ ID NOS: 26, 27 or 28. An AAV5 VP1 polypeptide can 
have at least about 58%, about 60%, about 70%, about 80%, about 90%, 93%, 95%, 97% or 

25 about 100% homology to the amino acid sequence set forth in SEQ ID NO: 26. An AAV5 
VP2 polypeptide can have at least about 58%, about 60%, about 70%, about 80%, about 
90%, 93%, 95%, 97% or about 100% homology to the amino acid sequence set forth in SEQ 
ID NO: 27. An AAV5 VP3 polypeptide can have at least about 60%, about 70%, about 
80%, about 90%, 93%, 95%, 97% or about 100% homology to the amino acid sequence set 

30 forth in SEQ ID NO: 28. 

The AAV ITRs in the vector for the herein described delivery methods can be AAV5 
ITRs (SEQ ID NOS: 41 and 42). Furthermore, the AAV ITRs in the vector for the herein 
described nucleic acid delivery methods can also comprise AAV1, AAV2, AAV3, AAV4, or 
AAV6 inverted terminal repeats. 
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5 Compositions and methods for making BAAV vectors 

Compositions and methods for making and using BAAV vectors have been 
previously described in U.S. Patent Application No. 60/526786, filed December 4, 2003, 
and in International Patent Application No. PCT/US04/40825, filed December 6, 2004, 
which are hereby incorporated herein by reference for this teaching. 

10 Provided is a recombinant bovine adeno-associated virus (BAAV). This virus has 

one or more of the characteristics described below. The compositions of the present 
invention do not include wild-type BAAV. The methods of the present invention can use 
either wild-type BAAV or recombinant B AAV-based delivery. 

Provided are novel BAAV particles, recombinant BAAV vectors and recombinant 

1 5 BAAV virions. An BAAV particle is a viral particle comprising an BAAV capsid protein. A 
recombinant BAAV vector is a nucleic acid construct that comprises at least one unique 
nucleic acid of BAAV. A recombinant BAAV virion is a particle containing a recombinant 
BAAV vector, wherin the particle can be either an BAAV particle as described herein or a 
non-BAAV particle. Alternatively, the recombinant BAAV virion is an BAAV particle 

20 containing a recombinant vector, wherein the vector can be either an BAAV vector as 

described herein or a non-BAAV vector. These vectors, particles, virions, nucleic acids and 

polypeptides are described below. 

Provided is the nucleotide sequence of the BAAV genome and vectors and particles 
derived therefrom. Specifically, provided is a nucleic acid vector comprising a pair of 

25 BAAV inverted terminal repeats (TTRs) and a promoter between the inverted terminal 
repeats. The rep proteins of AAV5 and BAAV will bind to the BAAV ITR and it can 
function as an origin of replication for packaging of recombinant AAV particles. The 
minimum sequence necessary for this activity is the TRS site where Rep cleaves in order to 
replicate the virus. Minor modifications in an ITR are contemplated and are those that will 

30 not interfere with the hairpin structure formed by the ITR as described herein and known in 
the art. Furthermore, to be considered within the term e.g. it must retain the Rep binding site 
described herein. 

The D- region of the AAV2 ITR, a single stranded region of the ITR, inboard of the 
TRS site, has been shown to bind a factor which depending on its phosphorylation state 
35 correlates with the conversion of the AAV from a single stranded genome to a 

transcriptionally active form that allows for expression of the viral DNA. This region is 
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5 conserved between AAV2, 3, 4,and 6 but is divergent in AAV5 and BAAV (SEQ ID NO: 
59). The D+ region is the reverse complement of the D- region. 

The promoter can be any desired promoter, selected by known considerations, such 
as the level of expression of a nucleic acid functionally linked to the promoter and the cell 
type in which the vector is to be used. That is, the promoter can be tissue/cell-specific. 

10 Promoters can be prokaryotic, eukaryotic, fungal, nuclear, mitochondrial, viral or plant 

promoters. Promoters can be exogenous or endogenous to the cell type being transduced by 
the vector. Promoters can include, for example, bacterial promoters, known strong 
promoters such as S V40 or the inducible metallothionein promoter, or an AAV promoter, • 
such as an AAV p5 promoter. Additionally, chimeric regulatory promoters for targeted gene 

1 5 expression can be utilized. Examples of these regulatory systems, which are known in the 
art, include the tetracycline based regulatory system which utilizes the tet transactivator 
protein (tTA), a chimeric protein containing the VP 16 activation domain fused to the tet 
repressor of Escherichia coli, the IPTG based regulatory system, the CID based regulatory 
system, and the Ecdysone based regulatory system. Other promoters include promoters 

20 derived from actin genes, immunoglobulin genes, cytomegalovirus (CMV), adenovirus, 
bovine papilloma virus, adenoviral promoters, such as the adenoviral major late promoter, 
an inducible heat shock promoter, respiratory syncytial virus, Rous sarcomas virus (RSV), 
etc., specifically, the promoter can be AAV2 p5 promoter or AAV5 p5 promoter or BAAV 
p5 promoter. More specifically, the BAAV p5 promoter can be in about the same location in 

25 SEQ ID NO: 47 as the AAV2 p5 promoter, in the corresponding AAV2 published sequence. 
Additionally, the p5 promoter may be enhanced by nucleotides 1 - 1 73 of SEQ ID NO: 47. 
Furthermore, smaller fragments of p5 promoter that retain promoter activity can readily be 
determined by standard procedures including, for example, constructing a series of deletions 
in the p5 promoter, linking the deletion to a reporter gene, and deteimining whether the 

30 reporter gene is expressed, i.e., transcribed and/or translated. The promoter can be the 
promoter of any of the AAV serotypes, and can be the pl9 promoter (SEQ ID NO: 62) or 
the p40 promoter set forth in the sequence listing as SEQ ID NO: 63. 

It should be recognized that any errors in any of the nucleotide sequences disclosed 
herein can be corrected, for example, by using the hybridization procedure described below 

35 with various probes derived from the described sequences such that the coding sequence can 
be reisolated and resequenced. Rapid screening for point mutations can also be achieved 
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5 with the use of polymerase chain reaction single strand conformation polymorphism (PCR 
SSCP). The corresponding amino acid sequence can then be corrected accordingly. 

The BAAV-derived vector can include any normally occurring BAAV nucleic acid 
sequences in addition to an ITR and promoter. The BAAV-derived vector can also include 
sequences that are at least 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to 

1 0 the BAAV nucleic acids set forth herein. Examples of vector constructs are provided below. 

The present vector or BAAV particle or recombinant BAAV virion can utilize any 
unique fragment of these present BAAV nucleic acids, including the BAAV nucleic acids 
set forth in SEQ ID NOS: 47, 48, 50, 52, 54, 56 and 58-63. To be unique, the fragment must 
be of sufficient size to distinguish it from other known sequences, most readily determined 

1 5 by comparing any nucleic acid fragment to the nucleotide sequences of nucleic acids in 
computer databases, such as GenBank. Such comparative searches are standard in the art. 
Typically, a unique fragment useful as a primer or probe will be at least about 8 or 10, 
preferable at least 20 or 25 nucleotides in length, depending upon the specific nucleotide 
content of the sequence. Additionally, fragments can be, for example, at least about 30, 40, 

20 50, 75, 1 00, 200 or 500 nucleotides in length and can encode polypeptides or be probes. The 
nucleic acid can be single or double stranded, depending upon the purpose for which it is 
intended. Where desired, the nucleic acid can be RNA 

The present invention further provides a BAAV capsid protein to contain the vector. 
In particular, provided is not only a polypeptide comprising all three BAAV coat proteins, 

25 i.e., VP1, VP2 and VP3, but also a polypeptide comprising each BAAV coat protein 
individually, SEQ ID NOS: 53, 55, and 57, respectively. Thus, an BAAV particle 
comprising an BAAV capsid protein comprises at least one BAAV coat protein VP1, VP2 
or VP3. A BAAV particle comprising an BAAV capsid protein can be utilized to deliver a 
nucleic acid vector to a cell, tissue or subject. For example, the herein described BAAV 

30 vectors can be encapsidated in an AAV5 capsid-derived particle and utilized in a gene 
delivery method. Furthermore, other viral nucleic acids can be encapsidated in the BAAV 
particle and utilized in such delivery methods. For example, an AAV1-8 or AAAV vector 
(e.g. AAV1-8 or AAAV ITR and nucleic acid of interest) can be encapsidated in an BAAV 
particle and administered Furthermore, a BAAV chimeric capsid incorporating both AAV1- 

35 8 or AAAV capsid and BAAV capsid sequences can be generated, by standard cloning 
methods, selecting regions from the known sequences of each protein as desired. For 
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5 example, particularly antigenic regions of the BAAV capsid protein can be replaced with the 
corresponding region of the BAAV capsid protein. In addition to chimeric capsids 
incorporating AAV2 capsid sequences, chimeric capsids incorporating AAV1, 3-8, and 
AAV5 capsid sequences can be generated, by standard cloning methods, selecting regions 
from the known sequences of each protein as desired. Alternatively a chimeric capsid can be 

10 made by the addition of a plasmid that expresses AAV1-8 capsid proteins at a ratio with the 
BAAV capsid expression plasmid that allows only a few capsid proteins to be incorpated 
into the BAAV particle. Thus, for example, a chimeric particle may be constructed that 
contains 6 AAV2 capsid proteins and 54 BAAV capsid proteins if the complete capsid 
contains 60 capsid proteins. 

1 5 The capsids can also be modified to alter their specific tropism by genetically 

altering the capsid to encode a specific ligand to a cell surface receptor. Alternatively, the 
capsid can be chemically modified by conjugating a ligand to a cell surface receptor. By 
genetically or chemically altering the capsids, the tropism can be modified to direct BAAV 
to a particular cell or population of cells. The capsids can also be altered immunologically 

20 by conjugating the capsid to an antibody that recognizes a specific protein on the target cell 

or population of cells. 

It has been recently reported that insertion of foreign epitopes (RGD motif; LH 

receptor targeting epitope) in certain regions of AAV2 capsid can redirect viral tropism. 

However, AAV2 naturally infects a wide variety of cell types and complete retargeting of 
25 rAAV2 would be difficult to achieve. Provided are two regions in the capsid of BAAV that 

are on the virus surface and could tolerate substitution. These two regions are aa 257-264 

(GSSNASDT, SEQ ID NO:67) and aa 444-457 (TTSGGTLNQGNSAT, SEQ ID NO:68). 

Other regions of the BAAV capsid could also accommodate the substitution of amino acids 

that would allow for epitope presentation on the surface of the virus. All of these regions 
30 would have in common 1) Surface exposure 2) able to support a substitution of sequence to 

insert the epitope 3) still allow for capsid assembly. 

Because of the symmetry of the AAV particles, a substitution in one subunit of the 

capsid will appear multiple times on the capsid surface. For example the capsid is made of 

approximately 55 VP3 proteins. Therefore an epitope incorporated in the VP3 protein could 
35 be expressed 55 times on the surface of each particle increasing the likelihood of the epitope 

forming a stable interaction with its target In some cases this may be too high of a ligand 
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5 density for functional binding or this high density of epitope may interfere with capsid 
fonnation. The epitope density could be lowered by introducing another plasmid into the 
packaging system for production of recombinant particles and the ratio between the 
packaging plasmid with the modified VP3 protein and the wt VP3 protein altered to balance 
the epitope density on the virus surface. 

10 Epitopes could be incorporated into the virus capsid for the purpose of 1) altering the 

tropism of the virus 2) blocking an immune response direct at the virus 3) developing a host 
immune response to the epitope for the purpose of vaccination. 

Examples of epitopes that could be added to BAAV capsids include but are not 

limited to: 

15 LH receptor binding epitope 

RGD integrin binding epitope 
CD13 binding epitope NGRAHA SEQ ID NO:69 
The Retanef polyprotein vaccine candidate for HIV-1 
single chain antibody fragments directed against tumor cells 
20 Endothelial cell binding epitope SIGYPLP SEQ ID NO:70 

serpin receptor ligand, KFNKPFVFLI SEQ ID NO:71 
protective B-cell epitope hemagglutinin (HA) 91-108 from influenza HA 
NDV B-cell immunodominant epitope (IDE) spanning residues 447 to 455 
Major immunogenic epitope for parvovirus B19 ( NISLDNPLENP S SLFDLVARIK, 
25 SEQ ID NO:72) that can elicit protective antibody titers. 

The capsids can also be assembled into empty particles by expression in mammalian, 
bacterial, fungal or insect cells. For example, AAV2 particles are known to be made from 
VP3 and VP2 capsid proteins in baculovirus. The same basic protocol can produce an empty 
BAAV particle comprising BAAV capsid proteins and also full particles. 
30 The herein described recombinant BAAV nucleic acid derived vector can be 

encapsidated in an AAV particle. In particular, it can be encapsidated in an AAV1 particle, 
an AAV2 particle, an AAV3 particle, an AAV4 particle, an AAV5 particle or an AAV6 or 
AAV7 or an AAV8 or AAAV particle, a portion of any of these capsids, or a chimeric 
capsid particle as described above, by standard methods using the appropriate capsid 
35 proteins in the encapsidation process, as long as the nucleic acid vector fits within the size 
limitation of the particle utilized. The encapsidation process itself is standard in the art The 
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5 B AAV replication machinery, i.e. the rep initiator proteins and other functions required for 
replication, can be utilized to produce the BAAV genome that can be packaged in an 
AAV 1-8 or AAAV capsid. 

The recombinant BAAV virion containing a vector can also be produced by 
recombinant methods utilizing multiple plasmids. In one example, the BAAV rep nucleic 

1 0 acid would be cloned into one plasmid, the BAAV ITR nucleic acid would be cloned into 
another plasmid and the AAV1-8 capsid nucleic acid would be cloned on another plasmid. 
These plasmids would then be introduced into cells. The cells that were efficiently 
transduced by all three plasmids, would exhibit specific integration as well as the ability to 
produce BAAV recombinant virus. Additionally, two plasmids could be used where the 

1 5 BAAV rep nucleic acid would be cloned into one plasmid and the BAAV ITR and BAAV 
capsid would be cloned into another plasmid. These plasmids would then be introduced into 
cells. The cells that were efficiently transduced by both plasmids, would exhibit specific 
integration as well as the ability to produce BAAV recombinant virus. 

An BAAV capsid polypeptide encoding the entire VP1, VP2, and VP3 polypeptide 

20 can overall have greater than 56% homology to the polypeptide having the amino acid 
sequence encoded by nucleotides in SEQ ID NOS: 52, 54 and 56. The capsid protein can 
have about 70% homology, about 75% homology, 80% homology, 85% homology, 90% 
homology, 95% homology, 98% homology, 99% homology, or even 100% homology to the 
protein having the amino acid sequence encoded by the nucleotides set forth in SEQ ID 

25 NOS: 52, 54 and 56. The percent homology used to identify proteins herein, can be based on 
a nucleotide-by-nucleotide comparison or more preferable is based on a computerized 
algorithm as described herein. Variations in the amino acid sequence of the BAAV capsid 
protein are contemplated herein, as long as the resulting particle comprising an BAAV 
capsid protein remains antigenically or immunologically distinct from AAV1-8 or AAAV 

30 capsid, as can be routinely determined by standard methods. Specifically, for example, 
ELBA and Western blots can be used to determine whether a viral particle is antigenically 
or immunologically distinct from AAV2 or the other serotypes. Furthermore, the BAAV 
particle preferably retains tissue tropism distinction from other AAVs, such as that 
exemplified in the examples herein. A BAAV chimeric particle comprising at least one 

35 BAAV coat protein may have a different tissue tropism from that of an BAAV particle 
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5 consisting only of BAAV coat proteins, but is still distinct from the tropism of an AAV2 
particle. 

The invention further provides a recombinant BAAV virion, comprising a BAAV 
particle containing, i.e., encapsidating, a vector comprising a pair of BAAV inverted 
terminal repeats. The recombinant vector can further comprise a BAAV Rep-encoding 
1 0 nucleic acid. The vector encapsidated in the particle can further comprise an exogenous 
nucleic acid inserted between the inverted terminal repeats. 

The invention further contemplates chimeric recombinant ITRs that contain a rep 
binding site and a TRS site recognized by that Rep protein. By "Rep protein" is meant all 
four of the Rep proteins, Rep 40, Rep 78, Rep 52, Rep 68. Alternatively, "Rep protein" 
1 5 could be one or more of the Rep proteins described herein. One example of a chimeric ITR 
would consist of an BAAV D region (SEQ ID NO: 59), an BAAV TRS site (SEQ ID NO: 
60), an AAV2 hairpin and an AAV2 Rep binding site. Another example would be a BAAV 
D region, an BAAV TRS site, an AAV3 hairpin and an AAV3 Rep binding site. In these 
chimeric ITRs, the D region can be from AAV1-8 or AAAV. The hairpin can be derived 
20 from AAV 1-8 or AAAV. The binding site can be derived from any of AAV1-8 or AAAV. 
Preferably, the D region and the TRS are from the same serotype. 

The chimeric ITRs can be combined with BAAV Rep protein and any of the AAV 
serotype capsids to obtain recombinant virion. For example, recombinant virion can be 
produced by a BAAV D region, an BAAV TRS site, an AAV2 hairpin, an AAV2 binding 
25 site, BAAV Rep protein and AAV1 capsid. This recombinant virion would possess the 
cellular tropism conferred by the AAV1 capsid protein and would possess the efficient 
replication conferred by the BAAV Rep. 

Other examples of the ITR Rep protein and Capsids that will produce recombinant 
virus are provided in the list below but not limited to : 
30 BAAV ITR + BAAV Rep + BAAV Cap=virus 

AAV5 ITR + BAAV Rep + BAAV Cap=virus 

AAV5 ITR + BAAV Rep + AAV1 Cap=virus 

AAV5 ITR + BAAV Rep + AAV2 Cap=virus 

AAV5 ITR + BAAV Rep + AAV3 Cap=virus 
35 AAV5 ITR + BAAV Rep + AAV4 Cap=virus 

AAV5 ITR + BAAV Rep + AAV5 Cap=virus 
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5 AAV5 ITR + BAAV Rep + AAV6 Cap=virus 

AAV5 ITR + BAAV Rep + AAV7 Cap=virus 

AAV5 ITR + BAAV Rep + AAV8 Cap=virus 

BAAV ITR + AAV5 Rep + BAAV Cap=virus 

BAAV ITR + AAV5 Rep + AAV1 Cap=virus 
10 BAAV ITR + AAV5 Rep + AAV2 Cap=vims 

BAAV ITR + AAV5 Rep + AAV3 Cap=virus 

BAAV ITR + AAV5 Rep + AAV4 Cap=virus 

BAAV ITR + AAV5 Rep + AAV5 Cap=virus 

BAAV ITR + AAV5 Rep + AAV6 Cap=virus 
1 5 BAAV ITR + AAV5 Rep + AAV7 Cap=virus 

BAAV ITR + AAV5 Rep + AAV8 Cap=virus 

AAV 5 ITR + AAV5 Rep + BAAV Cap=vims 

AAV1 ITR + AAV1 Rep + BAAV Cap=virus 

AAV2 ITR + AAV2 Rep + BAAV Cap=virus 
20 AAV3 ITR + AAV3 Rep + BAAV Cap=virus 

AAV4 ITR + AAV4 Rep + BAAV Cap=virus 

AAV5 ITR + AAV5 Rep + BAAV Cap=virus 

AAV6 ITR + AAV6 Rep + BAAV Cap=virus 

AAV7 ITR + AAV7 Rep + BAAV Cap=virus 
25 AAV8 ITR + AAV8 Rep + BAAV Cap=virus 

In any of the constructs described herein, inclusion of a promoter is preferred. As 
used in the constructs herein, unless otherwise specified, Cap (capsid) refers to any of 
BAAV VP1, BAAV VP2, BAAV VP3, combinations thereof, functional fragments of any 
of VP1, VP2 or VP3, or chimeric capsids as described herein. The ITRs of the constructs 
30 described herein, can be chimeric recombinant ITRs as described elsewhere in the 
application. 

Conjugates of recombinant or wild-type BAAV virions and nucleic acids or proteins 
can be used to deliver those molecules to a cell. For example, the purified BAAV can be 
used as a vehicle for delivering DNA bound to the exterior of the virus. Examples of this are 
35 to conjugate the DNA to the virion by a bridge using poly L lysine or other charged 

molecule. Also contemplated are virosomes that contain BAAV structural proteins (BAAV 
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5 capsid proteins), lipids such as DOTAP, and nucleic acids that are complexed via charge 
interaction to introduce DNA into cells. 

Also provided by this invention are conjugates that utilize the BAAV capsid or a 
unique region of the BAAV capsid protein (e.g. VP1, VP2 or VP3 or combinations thereof) 
to introduce DNA into cells. For example, the BAAV VP3 protein or fragment thereof, can 

1 0 be conjugated to a DNA on a plasmid that is conjugated to a lipid. Cells can be infected 
using the targeting ability of the VP3 capsid protein to achieve the desired tissue tropism, 
specific to BAAV. BAAV VP 1 and VP2 proteins can also be utilized to introduce DNA or 
other molecules into cells. By further incorporating the Rep protein and the AAV TRS into 
the DNA-containing conjugate, cells can be transduced and targeted integration can be 

15 achieved. For example, if BAAV specific targeted integration is desired, a conjugate 

composed of the BAAV VP3 capsid, BAAV rep or a fragment of BAAV rep, BAAV TRS, 
the rep binding site, the heterologous DNA of interest, and a lipid, can be utilized to achieve 
BAAV specific tropism and BAAV specific targeted integration in the genome. 

Further provided by this invention are chimeric viruses where BAAV can be 

20 combined with herpes virus, baculovirus or other viruses to achieve a desired tropism 

associated with another virus. For example, the BAAV ITRs could be inserted in the herpes 
virus and cells could be infected. Post-infection, the ITRs of BAAV could be acted on by 
BAAV rep provided in the system or in a separate vehicle to rescue BAAV from the 
genome. Therefore, the cellular tropism of the herpes simplex virus can be combined with 

25 BAAV rep mediated targeted integration. Other viruses that could be utilized to construct 
chimeric viruses include lentivirus, retrovirus, pseudotyped retroviral vectors, and 
adenoviral vectors. 

The present invention further provides isolated nucleic acids of BAAV. For 
example, provided is an isolated nucleic acid comprising the nucleotide sequence set forth 

30 in SEQ ID NO: 47 (BAAV genome). This nucleic acid, or portions thereof; can be inserted 
into vectors, such as plasmids, yeast artificial chromosomes, or other viral vector (particle), 
if desired, by standard cloning methods. The present invention also provides an isolated 
nucleic acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 47. 
The nucleotides of SEQ ID NO: 47 can have minor modifications and still be contemplated 

35 by the present invention. For example, modifications that do not alter the amino acid 

encoded by any given codon (such as by modification of the third, ' Vobble," position in a 
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5 codon) can readily be made, and such alterations are known in the art. Furthermore, 

modifications that cause a resulting neutral (conserved) amino acid substitution of a similar 
amino acid can be made in a coding region of the genome. Additionally, modifications as 
described herein for the BAAV components, such as the ITRs, the p5 promoter, etc. are 
contemplated in this invention. Furthermore, modifications to regions of SEQ ID NO 47 

1 0 other than in the ITR, TRS, Rep binding site and hairpin are likely to be tolerated without 
serious impact on the function of the nucleic acid as a recombinant vector. 

The present invention additionally provides an isolated nucleic acid that selectively 
hybridizes with any nucleic acid disclosed herein, including the entire BAAV genome and 
any unique fragment thereof, including the Rep and capsid encoding sequences (e.g. SEQ ID 

15 NOS: 47, 48, 50, 52, 54, 56, 58, 59, 60, 61, 62, 63). Specifically, the nucleic acid can 

selectively or specifically hybridize to an isolated nucleic acid consisting of the nucleotide 
sequence set forth in SEQ ID NO: 47 (BAAV genome). The present invention further 
provides an isolated nucleic acid that selectively or specifically hybridizes with an isolated 
nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO: 47 (BAAV 

20 genome). By "selectively hybridizes" as used herein is meant a nucleic acid that hybridizes 
to one of the disclosed nucleic acids under sufficient stringency conditions without 
significant hybridization to a nucleic acid encoding an unrelated protein, and particularly, 
without detectably hybridizing to nucleic acids of AAV2. Thus, a nucleic acid that 
selectively hybridizes with a nucleic acid of the present invention will not selectively 

25 hybridize under stringent conditions with a nucleic acid encoding a different protein or the 
corresponding protein from a different serotype of the virus, and vice versa. A "specifically 
hybridizing" nucleic acid is one that hybridizes under stringent conditions to only a nucleic 
acid found in BAAV. Therefore, nucleic acids for use, for example, as primers and probes to 
detect or amplify the target nucleic acids are contemplated herein. Nucleic acid fragments 

30 that selectively hybridize to any given nucleic acid can be used, e.g., as primers and or 
probes for further hybridization or for amplification methods (e.g., polymerase chain 
reaction (PCR), ligase chain reaction (LCR)). Additionally, for example, a primer or probe 
can be designed that selectively hybridizes with both BAAV and a gene of interest carried 
within the BAAV vector (i.e., a chimeric nucleic acid). 

35 A nucleic acid that selectively hybridizes to any portion of the BAAV genome is 

contemplated herein. Therefore, a nucleic acid that selectively hybridizes to BAAV can be 
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5 of longer length than the BAAV genome, it can be about the same length as the B AAV 
genome or it can be shorter than the BAAV genome. The length of the nucleic acid is 
limited on the shorter end of the size range only by its specificity for hybridization to 
BAAV, i.e., once it is too short, typically less than about 5 to 7 nucleotides in length, it will 
no longer bind specifically to BAAV, but rather will hybridize to numerous background 

1 0 nucleic acids. Additionally contemplated by this invention is a nucleic acid that has a 

portion that specifically hybridizes to BAAV and a portion that specifically hybridizes to a 

gene of interest inserted within BAAV. 

The present invention further provides an isolated nucleic acid encoding a bovine 
adeno-associated virus Rep protein. The BAAV Rep proteins are encoded by open reading 

15 frame (ORF) 1 of the BAAV genome. Examples of the BAAV Rep genes are shown in the 
nucleic acid set forth in SEQ ID NO: 47, and include nucleic acids consisting essentially of 
the nucleotide sequences set forth in SEQ ID NOS: 48 (rep78), 4(rep52) and nucleic acids 
comprising the nucleotide sequences set forth in SEQ ID NOS: 48 and 50. However, the 
present invention contemplates that the Rep nucleic acid can include any one, two, three, or 

20 four of the four Rep proteins, in any order, in such a nucleic acid. Furthermore, minor 
modifications are contemplated in the nucleic acid, such as silent mutations in the coding 
sequences, mutations that make neutral or conservative changes in the encoded amino acid 
sequence, and mutations in regulatory regions that do not disrupt the expression of the gene. 
Examples of other minor modifications are known in the art. Further modifications can be 

25 made in the nucleic acid, such as to disrupt or alter expression of one or more of the Rep 
proteins in order to, for example, determine the effect of such a disruption; such as to mutate 
one or more of the Rep proteins to determine the resulting effect, etc. However, in general, a 
modified nucleic acid encoding a Rep protein will have at least about 85%, about 90%, 
about 93%, about 95%, about 98% or 100% homology to the Rep nucleic sequences 

30 described herein e.g., SEQ ID NOS: 48 and 50, and the Rep polypeptide encoded therein 
will have overall about 93%, about 95%, about 98%, about 99% or 100% homology with 
the amino acid sequence described herein, e.g., SEQ ID NOS: 49 and 51. Percent homology 
is determined by the techniques described herein. 

The present invention also provides an isolated nucleic acid that selectively or 

35 specifically hybridizes with a nucleic acid consisting essentially of the nucleotide sequence 
set forth in SEQ ID NOS: 48 and 50, and an isolated nucleic acid that selectively hybridizes 
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5 with a nucleic acid comprising the nucleotide sequence set forth in SEQ ED NOS: 48 and 50. 
"Selectively hybridizing" and "stringency of hybridization" is defined elsewhere herein. 

As described above, provided is the nucleic acid encoding a Rep 78 protein and, in 
particular an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 
NO: 48, an isolated nucleic acid consisting essentially of the nucleotide sequence set forth in 

10 SEQ ID NO: 48, and a nucleic acid encoding the bovine adeno-associated virus protein 
having the amino acid sequence set forth in SEQ ID NO: 49. The present invention also 
provides the nucleic acid encoding a Rep 52 protein, and in particular an isolated nucleic 
acid comprising the nucleotide sequence set forth in SEQ ID NO: 50, an isolated nucleic 
acid consisting essentially of the nucleotide sequence set forth in SEQ ID NO: 50, and a 

15 nucleic acid encoding the bovine adeno-associated virus Rep 52 protein having the amino 
acid sequence set forth in SEQ ID NO: 51. As described elsewhere herein, these nucleic 
acids can have minor modifications, including silent nucleotide substitutions, mutations 
causing conservative amino acid substitutions in the encoded proteins, and mutations in 
control regions that do not or minimally affect the encoded amino acid sequence. 

20 The present invention further provides a nucleic acid encoding the entire B AAV 

Capsid polypeptide. Furthermore, provided is a nucleic acid encoding each of the three 
BAAV coat proteins, VP1, VP2, and VP3. Thus, provided is a nucleic acid encoding BAAV 
VP1, a nucleic acid encoding BAAV VP2, and a nucleic acid encoding BAAV VP3. Thus, 
provided is a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 53 

25 (VP1); a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 55 (VP2), 
and a nucleic acid encoding the amino acid sequence set forth in SEQ ID NO: 57 (VP3). 
The present invention also specifically provides a nucleic acid comprising SEQ ID NO: 52 
(VP1 gene); a nucleic acid comprising SEQ ID NO: 54 (VP2 gene); and a nucleic acid 
comprising SEQ ID NO: 56 (VP3 gene). The present invention also specifically provides a 

30 nucleic acid consisting essentially of SEQ ID NO: 52 (VP1 gene), a nucleic acid consisting 
essentially of SEQ ED NO: 54 (VP2 gene), and a nucleic acid consisting essentially of SEQ 
ID NO: 56 (VP3 gene). Minor modifications in the nucleotide sequences encoding the 
capsid, or coat, proteins are contemplated, as described above for other BAAV nucleic 
acids. However, in general, a modified nucleic acid encoding a capsid protein will have at 

35 least about 85%, about 90%, about 93%, about 95%, about 98% or 100% homology to the 
capsid nucleic sequences described herein e.g., SEQ ID NOS: 52, 54 and 56, and the capsid 
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5 polypeptide encoded therein will have overall about 93%, about 95%, about 98%, about 
99% or 100% homology with the amino acid sequence described herein, e.g., SEQ ID NOS: 
53, 55 and 57. Nucleic acids that selectively hybridize with the nucleic acids of SEQ ID 
NOS : 52, 54 and 56 under the conditions described above are also provided. 

Provided is an isolated BAAV Rep protein. An BAAV Rep polypeptide is encoded 

10 by ORF1 of BAAV. The present invention also provides each individual BAAV Rep 
protein. Thus provided is BAAV Rep 52 (e.g., SEQ DD NO: 50), or a unique fragment 
thereof. Provided is BAAV Rep 78 (e.g., SEQ ID NO: 48), or a unique fragment thereof. By 
"unique fragment thereof 51 is meant any smaller polypeptide fragment encoded by an BAAV 
rep gene that is of sufficient length to be found only in the Rep polypeptide. Substitutions 

1 5 and modifications of the. amino acid sequence can be made as described above and, further, 
can include protein processing modifications, such as glycosylation, to the polypeptide. 

The present invention further provides a BAAV Capsid polypeptide or a unique 
fragment thereof. BAAV capsid polypeptide is encoded by ORF 2 of BAAV. The present 
invention further provides the individual BAAV capsid proteins, VP1, VP2 and VP3 or 

20 unique fragments thereof. Thus, provided is an isolated polypeptide having the amino acid 
sequence set forth in SEQ ID NO:52 (VP1). The present invention additionally provides an 
isolated polypeptide having the amino acid sequence set forth in SEQ ID NO: 54 (VP2). The 
present invention also provides an isolated polypeptide having the amino acid sequence set 
forth in SEQ ID NO:56 (VP3). By "unique fragment thereof is meant any smaller 

25 polypeptide fragment encoded by any BAAV capsid gene that is of sufficient length to be 
found only in the BAAV capsid protein. Substitutions and modifications of the amino acid 
sequence can be made as described above and, further, can include protein processing 
modifications, such as glycosylation, to the polypeptide. However, an BAAV Capsid 
polypeptide including all three coat proteins will have greater than about 56% overall 

30 homology to the polypeptide encoded by the nucleotides set forth in SEQ ID NOS: 52, 54 or 
56. The protein can have about 65%, about 70%, about 75%, about 80%, about 85%, about 
90%, 93%, 95%, 97% or even 100% homology to the amino acid sequence encoded by the 
nucleotides set forth in SEQ ID NOS: 52, 54 or 56. An BAAV VP1 polypeptide can have at 
least about 58%, about 60%, about 70%, about 80%, about 90%, 93%, 95%, 97% or about 

35 100% homology to the amino acid sequence set forth in SEQ ID NO: 53. An BAAV VP2 
polypeptide can have at least about 58%, about 60%, about 70%, about 80%, about 90%, 
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93%, 95%, 97% or about 100% homology to the amino acid sequence set forth in SEQ ID 
NO: 55. An BAAV VP3 polypeptide can have at least about 60%, about 70%, about 80%, 
about 90%, 93%, 95%, 97% or about 100% homology to the amino acid sequence set forth 
in SEQ ID NO: 57. 

The present invention also provides a method of producing the BAAV virus by 
transducing a cell with the nucleic acid encoding the virus. 

The present method further provides a method of delivering an exogenous 
(heterologous) nucleic acid to a cell comprising aciministering to the cell an BAAV particle 
containing a vector comprising the nucleic acid inserted between a pair of AAV inverted 
terminal repeats, thereby delivering the nucleic acid to the cell. 

The AAV ITRs in the vector for the herein described delivery methods can be AAV 
ITRs (SEQ ID NOS: 58). Furthermore, the AAV ITRs in the vector for the herein described 
nucleic acid delivery methods can also comprise AAV 1-8 or AAAV inverted terminal 
repeats. 

Compositions and methods for making AAV7 vectors 

Compositions and methods for making and using AAV7 vectors have been 
previously described in Gao GP, et al. Proc Natl Acad Sci USA. 2002 Sep 3;99(18):1 1854- 
9; U.S. Patent Application 2003/0228282; and International Patent Application No. 
PCT/US02/33630, which are hereby incorporated by reference herein for the teaching of 
compositions and method for making and using AAV7 virions, vectors, and particles. 

Provided is a recombinant adeno-associated virus-7 (AAV7). This virus has one or 
more of the characteristics described below. The compositions of the present invention do 
not include wild-type AAV7. The methods of the present invention can use either wild-type 
AAV7 or recombinant AAV7-based delivery. 

Provided are AAV7 particles, recombinant AAV7 vectors and recombinant AAV7 
virions. An AAV7 particle is a viral particle comprising an AAV7 capsid protein. A 
recombinant AAV7 vector is a nucleic acid construct that comprises at least one unique 
nucleic acid of AAV7. A recombinant AAV7 virion is a particle containing a recombinant 
AAV7 vector, wherein the particle can be either an AAV7 particle as described herein or a 
non-AAV7 particle. Alternatively, the recombinant AAV7 virion is an AAV7 particle 
containing a recombinant vector, wherein the vector can be either an AAV7 vector as 



54 



WO 2006/029196 PCT7US2005/031837 

5 described herein or a non-AAV7 vector. These vectors, particles, virions, nucleic acids and 

polypeptides are described below. 

The AAV7-derived vector can include any normally occurring AAV7 nucleic acid 
sequences. The AAV7-derived vector can also include sequences that are at least 60%, 70%, 
80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to the AAV7 nucleic acids set forth 

1 0 herein. Examples of vector constructs are provided below. 

The present vector or AAV7 particle or recombinant AAV7 virion can utilize any 
unique fragment of the present AAV7 nucleic acids, including the AAV7 nucleic acids set 
forth in SEQ ID NO:64. Fragments can be, for example, at least about 30, 40, 50, 75, 100, 
200 or 500 nucleotides in length. The nucleic acid can be single or double stranded, 

1 5 depending upon the purpose for which it is intended. 

The present invention further provides an AAV7 capsid protein to contain the vector, 
hi particular, provided is a polypeptide comprising AAV7 capsid protein, SEQ ID NO:66. 
An AAV7 particle comprising an AAV7 capsid protein can be utilized to deliver a nucleic 
acid vector to a cell, tissue or subject. For example, the herein described AAV7 vectors can 

20 be encapsidated in an AAV5 capsid-derived particle and utilized in a gene delivery method. 
Furthermore, other viral nucleic acids can be encapsidated in the AAV7 particle and utilized 
in such delivery methods. For example, an AAV1-6, 8, BAAV or AAAV vector (e.g. 
AAV1-6, 8, BAAV or AAAV ITR and nucleic acid of interest) can be encapsidated in an 
AAV7 particle and administered. Furthermore, a AAV7 chimeric capsid incorporating 

25 AAV1-6, 8, BAAV or AAAV capsid, and AAV7 capsid sequences can be generated, by 
standard cloning methods, selecting regions from the known sequences of each protein as 
desired. For example, particularly antigenic regions of the AAV2 capsid protein can be 
replaced with the corresponding region of the AAV7 capsid protein. In addition to chimeric 
capsids incorporating AAV2 capsid sequences, chimeric capsids incorporating AAV1, 3-6, 

30 8, BAAV and AAV5 capsid sequences can be generated, by standard cloning methods, 
selecting regions from the known sequences of each protein as desired. Alternatively a 
chimeric capsid can be made by the addition of a plasmid that expresses AAV1, 3-6, 8, 
BAAV or AAV5 capsid proteins at a ratio with the AAV7 capsid expression plasmid that 
allows only a few capsid proteins to be incorpated into the AAV7 particle. Thus, for 

35 example, a chimeric particle may be constructed that contains 6 AAV2 capsid proteins and 
54 AAV7 capsid proteins if the complete capsid contains 60 capsid proteins. 
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5 The capsids can also be assembled into empty particles by expression in mammalian, 

bacterial, fungal or insect cells. For example, AAV2 particles are known to be made from 
VP3 and VP2 capsid proteins in baculovirus. The same basic protocol can produce an empty 
AAV7 particle comprising AAV7 capsid proteins and also full particles. 

The herein described recombinant AAV7 nucleic acid derived vector can be 

1 0 encapsidated in an AAV particle. In particular, it can be encapsidated in an AAV1 particle, 
an AAV2 particle, an AAV3 particle, an AAV4 particle, an AAV5 particle, an AAV6, an 
AAV8, a BAAV particle or AAAV particle, a portion of any of these capsids, or a chimeric 
capsid particle as described above, by standard methods using the appropriate capsid 
proteins in the encapsidation process, as long as the nucleic acid vector fits within the size 

1 5 limitation of the particle utilized. The encapsidation process itself is standard in the art. The 
AAV7 replication machinery, i.e. the rep initiator proteins and other functions required for 
replication, can be utilized to produce the AAV7 genome that can be packaged in an AAV1- 

6, 8, BAAV or AAAV capsid. 

The recombinant AAV7 virion containing a vector can also be produced by 

20 recombinant methods utilizing multiple plasmids. In one example, the AAV7 rep nucleic 
acid would be cloned into one plasmid, the AAV2 ITR nucleic acid would be cloned into 
another plasmid and the AAV7 capsid nucleic acid would be cloned on another plasmid. 
These plasmids would then be introduced into cells. The cells that were efficiently 
transduced by all three plasmids, would exhibit specific integration as well as the ability to 

25 produce AAV7 recombinant virus. Additionally, two plasmids could be used where the 
AAV7 rep nucleic acid would be cloned into one plasmid and the AAV7 ITR and AAV7 
capsid would be cloned into another plasmid. These plasmids would then be introduced into 
cells. The cells that were efficiently transduced by both plasmids, would exhibit specific 
integration as well as the ability to produce AAV7 recombinant virus. 

30 An AAV7 capsid polypeptide encoding the entire VP 1 polypeptide can overall have 

greater than 56% homology to the polypeptide having the amino acid sequence encoded by 
nucleotides in SEQ ID NO:66. The capsid protein can have about 70% homology, about 
75% homology, 80% homology, 85% homology, 90% homology, 95% homology, 98% 
homology, 99% homology, or even 100% homology to the protein having the amino acid 

35 sequence encoded by the nucleotides set forth in SEQ ID NO:66. The percent homology 
used to identify proteins herein, can be based on a nucleotide-by-nucleotide comparison or 
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5 more preferable is based on a computerized algorithm as described herein. Variations in the 
amino acid sequence of the AAV7 capsid protein are contemplated herein, as long as the 
resulting particle comprising an AAV7 capsid protein remains antigenically or 
immunologically distinct from AAV1-6, 8, BAAV or AAAV capsid, as can be routinely 
determined by standard methods. Specifically, for example, EIISA and Western blots can 

1 0 be used to determine whether a viral particle is antigenically or immunologically distinct 
from AAV2 or the other serotypes. Furthermore, the AAV7 particle preferably retains 
tissue tropism distinction from other AAVs. An AAV7 chimeric particle comprising at least 
one AAV7 coat protein may have a different tissue tropism from that of an AAV7 particle 
consisting only of AAV7 coat proteins, but is still distinct from the tropism of an AAV2 

1 5 particle. 

The invention further provides a recombinant AAV7 virion, comprising an AAV7 
particle containing, i.e., encapsidating, a vector comprising a pair of AAV7 inverted 
terminal repeats. The recombinant vector can further comprise an AAV7 Rep-encoding 
nucleic acid. The vector encapsidated in the particle can further comprise an exogenous 
20 nucleic acid inserted between the inverted terminal repeats. 

For example, recombinant virion can be produced by a AAV2 ITR, AAV2 Rep 
protein and AAV7 capsid. This recombinant virion would possess the cellular tropism 
conferred by the AAV7 capsid protein and would possess the efficient replication conferred 
by the AAV2 Rep. 

25 Other examples of the ITR, Rep protein and Capsids that will produce recombinant 

virus are provided in the list below but not limited to : 

AAVS ITR + AAV7 Rep + AAV1 Cap=virus 

AAVS ITR + AAV7 Rep + AAV2 Cap=virus 

AAVS ITR + AAV7 Rep + AAV3 Cap-virus 
30 AAV5 ITR + AAV7 Rep + AAV4 Cap-virus 

AAVS ITR + AAV7 Rep + AAVS Cap=virus 

AAV5 ITR + AAV7 Rep + AAV6 Cap=virus 

AAVS ITR + AAV7 Rep + AAV7 Cap=virus 

AAV5 ITR + AAV7 Rep + AAV8 Cap=virus 
3 5 AAVS ITR + AAV7 Rep + BAAV Cap=virus 

AAV5 ITR + AAV7 Rep + AAAV C ap=virus 
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5 AAV1 ITR + AAVl Rep + AAV7 Cap=virus 

AAV2 ITR + AAV2 Rep + AAV7 Cap=virus 

AAV3 ITR + AAV3 Rep + AAV7 Cap=virus 

AAV4 ITR + AAV4 Rep + AAV7 Cap=virus 

AAV5 ITR + AAV5 Rep + AAV7 Cap=virus 
1 0 AAV6 ITR + AAV6 Rep + AAV7 Cap=virus 

AAV8 ITR + AAV8 Rep + AAV7 Cap=virus 

BAAV ITR + BAAV Rep + AAV7 Cap=virus 

AAAV ITR + AAAV Rep + AAV7 Cap=virus 

15 In any of the constructs described herein, inclusion of a promoter is preferred. As 

used in the constructs herein, unless otherwise specified, Cap (capsid) refers to any of 
AAV7 VP1, AAV7 VP2, AAV7 VP3, combinations thereof, functional fragments of any of 
VP1 VP2 or VP3, or chimeric capsids as described herein. The ITRs of the constructs 
described herein, can be chimeric recombinant ITRs as described elsewhere in the 

20 application. 

Conjugates of recombinant or wild-type AAV7 virions and nucleic acids or proteins 
can be used to deliver those molecules to a cell. For example, the purified AAV7 can be 
used as a vehicle for delivering DNA bound to the exterior of the virus. Examples of this are 
to conjugate the DNA to the virion by a bridge using poly L lysine or other charged 

25 molecule. Also contemplated are virosomes that contain AAV7 structural proteins (AAV7 
capsid proteins), lipids such as DOTAP, and nucleic acids that are complexed via charge 
interaction to introduce DNA into cells. 

Also provided by this invention are conjugates that utilize the AAV7 capsid or a 
unique region of the AAV7 capsid protein (e.g. VP1, VP2 or VP3 or combinations thereof) 

30 to introduce DNA into cells. By "unique" is meant any smaller polypeptide fragment 
encoded by any AAV7 capsid gene that is of sufficient length to be unique to the AAV7 
Capsid protein. For example, the AAV7 VP1 protein or fragment thereof, can be conjugated 
to a DNA on a plasmid that is conjugated to a lipid. Cells can be infected using the targeting 
ability of the VP1 capsid protein to achieve the desired tissue tropism, specific to AAV7. 

35 AAV7 VP1 proteins can also be utilized to introduce DNA or other molecules into cells. By 
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5 further incorporating an AAV Rep protein and an AAV TRS into the DNA-containing 
conjugate, cells can be transduced and targeted integration can be achieved. 

The present invention further provides isolated nucleic acids of AAV7. For example, 
provided is an isolated nucleic acid comprising the nucleotide sequence set forth in SEQ ID 
NO:64. This nucleic acid, or portions thereof, can be inserted into vectors, such as plasmids, 

10 yeast artificial chromosomes, or other viral vector (particle), if desired, by standard cloning 
methods. The present invention also provides an isolated nucleic acid consisting essentially 
of the nucleotide sequence set forth in SEQ ID NO:64. The nucleotides of SEQ ID NO:64 
can have minor modifications and still be contemplated by the present invention. For 
example, modifications that do not alter the amino acid encoded by any given codon (such 

15 as by modification of the third, "wobble," position in a codon) can readily be made, and 
such alterations are known in the art. Furthermore, modifications that cause a resulting 
neutral (conserved) amino acid substitution of a similar amino acid can be made in a coding 

region of the genome. 

The present invention also provides an isolated nucleic acid that selectively or 

20 specifically hybridizes with a nucleic acid consisting essentially of the nucleotide sequence 
set forth in SEQ ID NO:64, and an isolated nucleic acid that selectively hybridizes with a 
nucleic acid comprising the nucleotide sequence set forth in SEQ ID NO:64. "Selectively 
hybridizing" and "stringency of hybridization" is defined elsewhere herein. 

The present invention further provides an isolated nucleic acid encoding a AAV7 

25 Rep protein. The AAV7 Rep proteins are encoded by open reading frame (ORF) 1 of the 
AAV7 genome. Examples of the AAV7 Rep genes are shown in the nucleic acid set forth in 
nucleotides 334-2205 of SEQ ED NO:64, and include nucleic acids consisting essentially of 
the nucleotide sequences set forth in 334-2205 of SEQ ID NO:64 (rep78). Minor 
modifications are contemplated in the nucleic acid, such as silent mutations in the coding 

30 sequences, mutations that make neutral or conservative changes in the encoded amino acid 
sequence, and mutations in regulatory regions that do not disrupt the expression of the gene. 
Examples of other minor modifications are known in the art. Further modifications can be 
made in the nucleic acid, such as to disrupt or alter expression of one or more of the Rep 
proteins in order to, for example, determine the effect of such a disruption; such as to mutate 

35 one or more of the Rep proteins to determine the resulting effect, etc. However, in general, a 
modified nucleic acid encoding a Rep protein will have at least about 85%, about 90%, 
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5 about 93%, about 95%, about 98% or 100% homology to the Rep nucleic sequences 

described herein e.g., SEQ ID NOS:65, and the Rep polypeptide encoded therein will have 
overall about 93%, about 95%, about 98%, about 99% or 100% homology with the amino 
acid sequence described in SEQ ID NO:65. Percent homology is determined by the 
techniques described herein. 

1 0 The present invention further provides a nucleic acid encoding the entire AAV7 

Capsid polypeptide. Thus, provided is a nucleic acid encoding the amino acid sequence set 
forth in nucleotides 2222-4435 of SEQ ID NO:64 (VP1). Minor modifications in the 
nucleotide sequences encoding the capsid, or coat, proteins are contemplated, as described 
above for other AAV7 nucleic acids. However, in general, a modified nucleic acid encoding 

1 5 a capsid protein will have at least about 85%, about 90%, about 93%, about 95%, about 
98% or 100% homology to the capsid nucleic sequences described herein e.g., nucleotides 
2222-4435 of SEQ ID NO:64, and the capsid polypeptide encoded therein will have overall 
about 93%, about 95%, about 98%, about 99% or 100% homology with the amino acid 
sequence described herein, e.g., SEQ ID NO:66. 

20 AAV Vector Generation 

It is understood that as discussed herein the use of the terms "homology" and 
"identity" mean the same thing as similarity. Thus, for example, if the use of the word 
homology is used to refer to two non-natural sequences, it is understood that this is not 
necessarily indicating an evolutionary relationship between these two sequences, but rather 

25 is looking at the similarity or relatedness between their nucleic acid sequences. Many of the 
methods for determining homology between two evolutionarily related molecules are 
routinely applied to any two or more nucleic acids or proteins for the purpose of measuring 
sequence similarity regardless of whether they are evolutionarily related. 

In general, it is understood that one way to define any known variants and 

30 derivatives or those that might arise, of the disclosed nucleic acids and polypeptides herein, 
is through defining the variants and derivatives in terms of homology to specific known 
sequences. In general, variants of nucleic acids and polypeptides herein disclosed typically 
have^at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 
89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to the stated sequence or the 

35 native sequence. Those of skill in the art readily understand how to determine the homology 
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5 of two polypeptides or nucleic acids. For example, the homology can be calculated after 
aligning the two sequences so that the homology is at its highest level. 

Another way of calculating homology can be performed by published algorithms. 
Optimal alignment of sequences for comparison may be conducted by the local homology 
algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology 

10 alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search 
for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 
(1988), by computerized implementations of these algorithms (GAP, BESTFTT, FASTA, 
and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 
Science Dr., Madison, WI; the BLAST algorithm of Tatusova and Madden FEMS 

15 Microbiol. Lett. 174: 247-250 (1999) available from the National Center for Biotechnology 
Information (http://ww.ncbi.ri]m.nm.gov/blast/bl2seq^l2.html) ), or by inspection. 

The same types of homology can be obtained for nucleic acids by for example the 
algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. 
Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 183:281-306, 1989 which 

20 are herein incorporated by reference for at least material related to nucleic acid alignment. It 
is understood that any of the methods typically can be used and that in certain instances the 
results of these various methods may differ, but the skilled artisan understands if identity is 
found with at least one of these methods, the sequences would be said to have the stated 
identity. 

25 For example, as used herein, a sequence recited as having a particular percent 

homology to another sequence refers to sequences that have the recited homology as 
calculated by any one or more of the calculation methods described above. For example, a 
first sequence has 80 percent homology, as defined herein, to a second sequence if the first 
sequence is calculated to have 80 percent homology to the second sequence using the Zuker 

30 calculation method even if the first sequence does not have 80 percent homology to the 

second sequence as calculated by any of the other calculation methods. As another example, 
a first sequence has 80 percent homology, as defined herein, to a second sequence if the first 
sequence is calculated to have 80 percent homology to the second sequence using both the 
Zuker calculation method and the Pearson and Lipman calculation method even if the first 

35 sequence does not have 80 percent homology to the second sequence as calculated by the 
Smith and Waterman calculation method, the Needleman and Wunsch calculation method, 
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5 the Jaeger calculation methods, or any of the other calculation methods. As yet another 
example, a first sequence has 80 percent homology, as defined herein, to a second sequence 
if the first sequence is calculated to have 80 percent homology to the second sequence using 
each of calculation melhods (although, in practice, the different calculation methods will 
often result in different calculated homology percentages). 

1 0 Stringency of hybridization is controlled by both temperature and salt concentration 

of either or both of the hybridization and washing steps. Typically, the stringency of 
hybridization to achieve selective hybridization involves hybridization in high ionic strength 
solution (6X SSC or 6X SSPE) at a temperature that is about 12-25°C below the Tm (the 
melting temperature at which half of the molecules dissociate from their hybridization 

1 5 partners) followed by washing at a combination of temperature and salt concentration 
chosen so that the washing temperature is about 5°C to 20°C below the Tm. The 
temperature and salt conditions are readily determined empirically in preliminary 
experiments in which samples of reference DNA immobilized on filters are hybridized to a 
labeled nucleic acid of interest and then washed under conditions of different stringencies. 

20 Hybridization temperatures are typically higher for DNA-RNA and RNA-RNA 

hybridizations. The washing temperatures can be used as described above to achieve 
selective stringency, as is known in the art. (Sambrook et al., Molecular Cloning: A 
Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York, 1989; Kunkel et al. Methods Enzymol. 1987:154:367, 1987). A preferable stringent 

25 hybridization condition for a DNADNA hybridization can be at about 68°C (in aqueous 
solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of hybridization 
and washing, if desired, can be reduced accordingly as the degree of complementarity 
desired is decreased, and further, depending upon the G-C or A-T richness of any area 
wherein variability is searched for. Likewise, stringency of hybridization and washing, if 

30 desired, can be increased accordingly as homology desired is increased, and further, 

depending upon the G-C or A-T richness of any area wherein high homology is desired, all 
as known in the art. 

In vivo administration to a human subject or an animal model can be by any of many 
standard means for administering viruses, depending upon the target organ, tissue or cell. 
3 5 Virus particles can be administered orally, parenteraUy (e.g., intravenously), by 

intramuscular injection, intrarectaUy, by direct tissue or organ injection, by intraperitoneal 
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5 injection, topically, transdermally, via aerosol delivery, via the mucosa or the like. Viral 
nucleic acids (non-encapsidated) can also be administered, e.g., as a complex with cationic 
liposomes, or encapsulated in anionic liposomes. The present compositions can include 
various amounts of the selected viral particle or non-encapsidated viral nucleic acid in 
combination with a pharmaceutical^ acceptable carrier and, in addition, if desired, may 

10 include other medicinal agents, pharmaceutical agents, carriers, adjuvants, diluents, etc. 
Parental administration, if used, is generally characterized by injection. Injectables can be 
prepared in conventional forms, either as liquid solutions or suspensions, solid forms 
suitable for solution or suspension in liquid prior to injection, or as emulsions. Dosages will 
depend upon the mode of administration, the disease or condition to be treated, and the 

1 5 individual subject's condition, but will be that dosage typical for and used in administration 
of other AAV vectors, such as AAV2 vectors. Often a single dose can be sufficient; 
however, the dose can be repeated if desirable. 

Administration of a recombinant AAV virion to the cell can be accomplished by any 
means, including simply contacting the particle, optionally contained in a desired liquid 

20 such as tissue culture medium, or a buffered saline solution, with the cells. The virion can 
be allowed to remain in contact with the cells for any desired length of time, and typically 

r ■ 

the virion is administered and allowed to remain indefinitely. For such in vitro methods, the 
virion can be administered to the cell by standard viral transduction methods, as known in 
the art and as exemplified herein. Titers of virus to administer can vary, particularly 

25 depending upon the cell type, but will be typical of that used for AAV transduction in 
general which is well known in the art. Additionally the titers used to transduce the 
particular cells in the present examples can be utilized. 

The cells that can be transduced by the present recombinant AAV virions can 
include any desired cell, such as the following cells and cells derived from the following 

30 tissues, human as well as other mammalian tissues, such as primate, horse, sheep, goat, pig, 
dog, rat, and mouse and avian species: Adipocytes, Adenocyte, Adrenal cortex, Amnion, 
Aorta, Ascites, Astrocyte, Bladder, Bone, Bone marrow, Brain, Breast, Bronchus, Cardiac 
muscle, Cecum, Cervix, Chorion, Cochlear, Colon, Conjunctiva, Connective tissue, Cornea, 
Dennis, Duodenum, Embryonic stem cells, Endometrium, Endothelium, Endothelial cells, 

35 Epithelial tissue, Epithelial cells, Epidermis, Esophagus, Eye, Fascia, Fibroblasts, Foreskin, 
Gastric, Glial cells, Glioblast, Gonad, Hepatic cells, Histocyte, Hair cells in the inner ear, 
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5 Ileum, Intestine, small Intestine, Jejunum, Keratinocytes, Kidney, Larynx, Leukocytes, 
Lipocyte, Liver, Lung, Lymph node, Lymphoblast, Lymphocytes, Macrophages, Mammary 
alveolar nodule, Mammary gland, Mastocyte, Maxilla, Melanocytes, Mesenchymal, 
Monocytes, Mouth, Myelin, Myoblasts Nervous tissue, Neuroblast, Neurons, Neuroglia, 
Osteoblasts, Osteogenic cells, Ovary, Palate, Pancreas, Papilloma, Peritoneum, Pituicytes, 

10 Pharynx, Placenta, Plasma cells, Pleura, Prostate, Rectum, Salivary gland, Skeletal muscle, 
Skin, Smooth muscle, Somatic, Spleen, Squamous, Stem cells, Stomach, Submandibular 
gland, Submaxillary gland, Synoviocytes, Testis, Thymus, Thyroid, Trabecular Trachea, 
Turbinate, Umbilical cord, Ureter, Uterus, and vestibular hair cells. 

Stringency of hybridization is controlled by both temperature and salt concentration 

1 5 of either or both of the hybridization and washing steps. Typically, the stringency of 

hybridization to achieve selective hybridization involves hybridization in high ionic strength 
solution (6X SSC or 6X SSPE) at a temperature that is about 12-25°C below the Tm (the 
melting temperature at which half of the molecules dissociate from their hybridization 
partners) followed by washing at a combination of temperature and salt concentration 

20 chosen so that the washing temperature is about 5°C to 20°C below the Tm. The 
temperature and salt conditions are readily determined empirically in preliminary 
experiments in which samples of reference DNA immobilized on filters are hybridized to a 
labeled nucleic acid of interest and then washed under conditions of different stringencies. 
Hybridization temperatures are typically higher for DNA-RNA and RNA-RNA 

25 hybridizations. The washing temperatures can be used as described above to achieve 
selective stringency, as is known in the art. (Sambrook et al., Molecular Cloning: A 
Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York, 1989; Kunkel et al. Methods Enzymol. 1987:154:367, 1987). A preferable stringent 
hybridization condition for a DNA:DNA hybridization can be at about 68°C (in aqueous 

30 solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of hybridization 
and washing, if desired, can be reduced accordingly as the degree of complementarity 
desired is decreased, and further, depending upon the G-C or A-T richness of any area 
wherein variability is searched for. Likewise, stringency of hybridization and washing, if 
desired, can be increased accordingly as homology desired is increased, and further, 

3 5 depending upon the G-C or A-T richness of any area wherein high homology is desired, all 
as known in the art. 
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5 By the "suitability of an AAV vector for administration to a subject 5 ' is meant a 

determination of whether the AAV vector will elicit a neutralizing immune response upon 
administration to a particular subject. A vector that does not elicit a significant immune 
response is a potentially suitable vector, whereas a vector that elicits a significant, 
neutralizing immune response (e.g. at least 90%) is thus likely to be unsuitable for use in 

1 0 that subj ect. Significance of any detectable immune response is a standard parameter 
understood by the skilled artisan in the field. For example, one can incubate the subject's 
serum with the virus, then determine whether that virus retains its ability to transduce cells 
in culture. If such virus cannot transduce cells in culture, the vector likely has elicited a 

significant immune response. 

1 5 Alternatively, or additionally, one skilled in the art could determine whether or not 

AAV administration would be suitable for a particular cell type of a subject. For example, 
the artisan could culture muscle cells in vitro and transduce the cells with AAV in the 
presence or absence of the subject's serum. If there is a reduction in transduction efficiency, 
this could indicate the presence of a neutralizing antibody or other factors that may inhibit 

20 transduction. Normally, greater than 90% inhibition would have to be observed in order to 
rule out the use of AAV-5 as a vector. However, this limitation could be overcome by 
treating the subject with an immunosuppressant that could block the factors inhibiting 
transduction. 

EXAMPLES 

25 The following examples are put forth so as to provide those of ordinary skill in the 

art with a complete disclosure and description of how the compounds, compositions, 
articles, devices and/or methods claimed herein are made and evaluated, and are intended to 
be purely exemplary of the invention and are not intended to limit the scope of what the 
inventors regard as their invention. Efforts have been made to ensure accuracy with respect 

30 to numbers (e.g., amounts, temperature, etc.), but some errors and deviations should be 

accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in °C or 
is at ambient temperature, and pressure is at or near atmospheric. 

Example! 

Previous research had demonstrated that Caco-2 and MDCK cells are model cell 
35 lines for the study of macromolecular transport via transcytosis. Furthermore these cell lines 
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5 have been used to demonstrate transcytosis of both viruses and proteins. Therefore, to test if 
AAV can spread through tissue by transcytosis, 2xl0 8 DNA resistant particles of 
recombinant AAV2 (rAAV2) AAV4, AAV5, AAV6, BAAV suspended in 50ul of medium 
were placed in the upper (apical) side of the transwell polycarbonate filter over a monolayer 
of cells each of the following cells Caco-2, MDCKI, MDCKH, Human primary airways 

10 epithelia cells (Airway), Human primary immortalized epithelial endometrial, Bovine brain 
primary endothelia cells (BBB), or HeLa. All cultures had TERs indicating the formation of 
tight junctions and polarized phenotype. After 3 hours of incubation the medium in the basal 
side of the transwell was collected and tested for the presence of transcytosed rAAV DNA. 
Viral DNA was extracted from 200ul of basal medium and quantified by qPCR. 

15 In these cell lines, transcytosis was observed with several AAV serotypes and appeared 

to be both serotype and tissue-specific (Fig. 1). Three hours after the addition of AAV to the 
apical surface of the cells, over 800,000 particles of AAV5 were present in the media on the 
basal lateral side of the trans-well insert of CaCo-2 cells, but not the MDCK, airway epithelia, 
endometrial, or BBB cells (Fig. 1 ). Similarly BAAV particles were detected in the media on the 

20 basal lateral side of the MDCK, airways epithelia, endometrial, and BBB cells but not the Caco- 
2 cells. Interestingly, AAV4 was detected in the basal lateral media of all cell types. No virus 
was detected in the basal lateral media when AAV2 was added to the apical surface in either 
cell type. AAV6 did not transcytose in any of cell types tested, and was not tested on airway 
epithelia or BBB. HeLa cells do not form barrier epithelia and were used as a control. 

25 Example 2 

Previous work has demonstrated that transcytosis is a temperature dependent process 
than can be inhibited at 4°C. Transcytosis can also be inhibited by the addition of agents that 
selectively fix the plasma membrane. Recently the addition of tannic acid, a mild fixative 
agent, to the basal lateral surface blocked the transcytosis of GPI-anchored proteins to the 

30 apical surface (Polishchuk R, Nat Cell Biol 2004. 6(4):297-307). Therefore the ability of 
this agent to block the transcytosis of AAV was tested. Treatment of the basal lateral 
surface of either Caco-2 or MDCK cells prior to virus addition to the apical surface blocked 
the accumulation of AAV5 or BAAV in the basal lateral media. Furthermore, quantification 
of the intracellular virus demonstrated inhibition of exocytosis by tannic acid treatment 

35 dramatically increase the amount of AAV DNA in the cell suggesting the viral particles 
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5 detected in the basal lateral media are the result of an intracellular transport process and not 

a paracellular route. 

Treatment of the basal lateral surface of Human primary airways epithelial cell 

(HAE) with tannic acid blocked the transcytosis of B AAV or AAV4 vector containing a 

GFP expression cassette from the apical surface to the basal lateral (Fig. 2). Furthermore 
10 transduction dramatically increased when assayed at 24 hrs post inoculation. In contrast no 

change was observed in AAV2 transduction, which did not demonstrate any transcytosis 

activity and has limited binding activity on HAE. 

Example 3 

■ 

To confirm the DNA detected in the basal lateral media was indeed extracted from 

1 5 intact virus, the material was tested for DNase resistance after treatment with heat, ionic 
detergent or protease. The addition of DNase alone or in combination with the ionic 
detergent deoxycholine had no effect on the viral DNA present in the media suggesting it 
was not free DNA or complexed in lipid vesicles. However, heating to 95°C prior to 
treatment with DNAase completely degraded the viral DNA present in the media. This 

20 profile is identical to that of the input AAV particles and suggests the viral DNA is still 
encapsulated. Titration of the DNase resistant virus in the basal lateral media on Cos cells 
gave a similar particle to infectivity ratio to the input AAV particles. 

While it would appear the AAV DNA detected in the basal lateral media is 
contained in intact particles, its presence on the basal lateral surface could be the result of 

25 lyses of the cells or disruption of the monolayer. Therefore the TER was carefully monitored 
throughout the course of these experiments and was not observed to decrease. To further 
confirm the integrity of the cell monolayer, mixing experiments were studied in which two 
viruses with differe^gene^assettes were added to the apical surface at the same time and 
three hours post addition the amount of each virus in the basal lateral media was quantified 

30 using QPCR specific for each cassette. Both BAAV and AAV5 were able to pass from the 
apical to the basal lateral surface of MDCK or Caco cells respectively but the AAV2 did 
not Therefore the presence of viral particles in the basal lateral media does not appear to be 
the result of a disruption in the cell monolayer. 

Taken together this data suggest that dependoviruses particles are capable of passing 

35 through barrier epithelia via transcytosis and the process is both serotype and cell type 
specific. 
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Example 4 

To further characterize the transcytosis activity observed with AAV5 and BAAV, 
transcytosis was quantified as both a time and concentration dependent event. After the 
addition of particles to the apical surface, samples were removed from the basal lateral 
media at different time points and the amount of virus was quantified by QPCR of the 
extracted DNA. Viral genomes could be detected as soon as 30 minutes after addition and 
steadily increased with time By 24 hrs, over 1/3 of the input recombinant AAV5, 
B AAVvirus added to Caco or MDCK cells respectively had been transported to the basal 
lateral surface. In contrast, none of the input AAV2 or adenovirus was detected on the basal 

lateral side after 24 hrs. 

If transcytosis is an activity used by AAV to spread through tissue, this finding 
would help explain the lack of transduction of barrier epithelia reported with some isolates 
of AAV. Primary human bronchial airway epithelia (HAE) are known to transport albumin 
from the apical to the basal lateral surface by receptor-mediated transcytosis in vivo. While 
the interaction of BAAV with primary HAE has not been investigated, AAV4, 5 are 
reported to bind to HAE, however, for AAV4, this interaction does not result in 
transduction. Because of the interaction of AAV4 with O-link sialic acid, it was proposed, 
and has been demonstrated, that mucins, which contained large amounts of O-linked sialic 
acid and are expressed on the apical surface of HAE, can block AAV4 transduction. 
Alternatively the lack of transduction could be the result of transcytosis of the virus through 
the tissue. 

To test this hypothesis, AAV2, 4, 5, BAAV were added to the apical surface of 
confluent monolayer cultures of primary human bronchial airway and transcytosis to the 
basal lateral surface was measured by QPCR after 3 hrs. All cultures had high TERs and 
expressed ciliated structures on their apical surface. Highly differentiated HAE cultures in 
contrast to immature cultures are resistant to transduction by adenoviral vectors due to a 
lack of integrin expression that is necessary for adenovirus entry. 

Of the 4 AAVs tested for transcytosis, AAV4 and BAAV were detected in the basal 
lateral media. No transport of AAV2 or AAV5 was detected. As a control, adenovirus also 
was tested for transcytosis activity in the HAE cultures, but no transport was detected. 
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5 Example 5 

Epithelial cells that line the genitourinary tract form an important epithelial barrier 
layer and can transport proteins by transcytosis. AAV2, 4, 5 or BAAV were therefore tested 
to determine for the ability to penetrate this barrier epithelial layer by transcytosis. A well- 
characterized model of endometrial cells has been reported by Kyo et al. Following addition 
10 of the 4 AAVs to the apical surface, BAAV and AAV4 could be detected in the basal lateral 
media when assayed at 3hrs post inoculation (Fig. 1). 

Example 6 

Most AAVs were identified originally as contaminants of laboratory stocks of 
adenovirus, thus our understanding of their natural biology, cell tropism, and knowledge the 

1 5 cellular components required for virus entry is limited. For AAVS, in addition to N-linked 
sialic acid, the platelet derived growth factor (PDGF) receptors were identified as protein 
receptors for AAVS pi Pasquale et al., Nat Med. 2003 Oct;9(10): 1306-12). This interaction 
was confirmed by modulation of PDGFR expression by transfection of expression plasmids, 
inhibitor treatment, or competition experiments with the extracellular domain of PDGFRa. 

20 Likewise AAV5 transduction could be blocked with sialolactosamine conjugates kaludov et 
al200L 

Previous research had demonstrated that transcytosis is actin dependent and occurs 
by a caviolin mediated pathway. Furthermore transcytosis can be blocked by treatment with 
tannic acid. Therefore to better characterize the transcytosis pathway utilized by AAVS in 
25 Caco cells the cells were treated with a panel of agents known to block either transcytosis in 
other systems or AAVS mediated transduction. It was noted that AAVS transcytosis could 
be inhibited by filipin and nocozodol as well as treatment with tannic acid. 

Caco cells, which actively transcytosis AAVS, are not reported to express PDGFR 
and are not transduced by AAV5. In agreement, competition experiments with sPDGFRa 
30 had little effect on AAVS transcytosis. Furthermore, competition experiments with 200 
ug/ml sialolactosamine or 200 ug/ml heparin did not inhibited AAV5 transcytosis. 

Both BSA and transferrin are reported to transcytosis through Caco cells via distinct 
receptor mediated pathways. However competition with either agent did not inhibit AAV5 
transcytosis suggesting the AAVS could use a distinct pathway. 
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5 In addition to confirming the intracellular nature of AAV5 transcytosis in Caco cells, the 
above experiments suggest that AAV5 transcytosis is occurring by a pathway independent 
of the one described for transduction. To confirm this Caco cells were stably transfected 
with PDGFRa and assayed for both transcytosis and transduction activity. Caco cells were 
not permissive for AAV5 transduction, however transduction dramatically increase 

1 0 following stable expression of PDGFRa. In contrast only a minor increase in transcytosis 
activity was detected in the Caco/PDGFRa cells. 

Throughout this application, various publications are referenced. The disclosures of 
these publications in their entireties are hereby incorporated by reference into this 
application in order to more fully describe the state of the art to which this invention 

1 5 pertains. 

It will be apparent to those skilled in the art that various modifications and variations 
can be made in the present invention without departing from the scope or spirit of the 
invention. Other embodiments of the invention will be apparent to those skilled in the art 
from consideration of the specification and practice of the invention disclosed herein. It is 
20 intended that the specification and examples be considered as exemplary only, with a true 
scope and spirit of the invention being indicated by the following claims. 
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CLAIMS 

What is claimed is: 

1 . A method of delivering a heterologous nucleic acid across an epithelial barrier 
comprising delivering to the epithelial barrier an AAV vector comprising the 
heterologous nucleic acid. 

2. The method of claim 1 , wherein the epithelial cells are in the gut, lung, genitourinary 

tract, kidney, blood vessels or brain. 

3. The method of claim 1, wherein the epithelial cells can be selected from a group 
consisting of bronchial, alveolar, tracheal or upper airway epithelial cells; absorptive 
enterocytes; endometrial or urinary epithelial cells; renal collecting duct or proximal 
tubule epithelial cells; cerebral microvascular endothelial cells; or Choroidal Plexus 
epithelial cells . 

4. A method of transcytosing epithelial cells of a human subject comprising 
administering to the subject an AAV vector comprising a heterologous nucleic acid. 

5 . The method of claim 4, wherein the epithelial cells ai;e selected from a group 
consisting of bronchial, alveolar, tracheal or upper airway epithelial cells; absorptive 
enterocytes; endometrial or urinary epithelial cells; renal collecting duct or proximal 
tubule epithelial eels; cerebral microvascular endothelial cells; or Choroidal Plexus 
epithelial cells. 

6. A method of delivering a heterologous nucleic acid across human airway epithelial 
cells, comprising delivering to the cells a BAAV vector comprising the nucleic acid. 

7. A method of delivering a heterologous nucleic acid across human cerebral 
microvascular endothelial cells, comprising delivering to the cells a BAAV vector 
comprising the nucleic acid. 

8 . A method of delivering a heterologous nucleic acid across human endometrial 
epithelial cells, comprising delivering to the cells a BAAV vector comprising the 
nucleic acid. 

9. A method of delivering a heterologous nucleic acid across human kidney epithelial 
cells, comprising delivering to the cells a BAAV vector comprising the nucleic acid. 

10. A method of delivering a heterologous nucleic acid across human enterocytes, 
comprising delivering to the cells a AAV5 vector comprising the nucleic acid. 

71 



WO 2006/029196 PCT/US2005/031837 

11. A method of delivering a heterologous nucleic acid across human airway epithelial 
cells, comprising delivering to the cells a AAV4 vector comprising the nucleic acid. 

12. A method of delivering a heterologous nucleic acid across human cerebral 
microvascular endothelial cells, comprising delivering to the cells a AAV4 vector 
comprising the nucleic acid. 

13. A method of delivering a heterologous nucleic acid across human endometrial 
epithelial cells, comprising delivering to the cells a AAV4 vector comprising the 
nucleic acid. 

14. A method of delivering a heterologous nucleic acid across human kidney epithelial 
cells, comprising delivering to the cells a AAV4 vector comprising the nucleic acid. 

15. A method of delivering a heterologous nucleic acid across human enterocytes 
comprising delivering to the cells a AAV4 vector comprising the nucleic acid. 

16. A method of delivering a heterologous nucleic acid across human cerebral 
microvascular endothelial cells, comprising delivering to the cells a AAV7 vector 

comprising the nucleic acid. 

17. A method of delivering a heterologous nucleic acid across an epithelial barrier of the 
lung, comprising delivering to the lung a B AAV vector comprising the nucleic acid. 

18. The method of claiml7, wherein the epithelial barrier comprises human bronchial, 
alveolar, tracheal or upper airway epithelial cells. 

19. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
brain, comprising delivering to the brain a BAAV vector comprising the nucleic 
acid. 

20. The method of claiml9, wherein the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

21. A method of delivering a heterologous nucleic acid across the epithelial barrier of 
blood vessels into the muscle, comprising delivering to the blood stream a BAAV 
vector comprising the nucleic acid. 

22. The method of claim 2 1 , wherein the epithelial barrio comprises human vascular 
endothelial cells of the blood brain barrier. 



72 



WO 2006/029196 PCT7US2005/031837 

23. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
genitourinary tract, comprising delivering to the genitourinary tract a BAAV vector 
comprising the nucleic acid genitourinary tract. 

24. The method of claim 23, wherein the epithelial barrier comprises human endometrial 
or urinary epithelial cells. 

25. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
kidney, comprising delivering to the genitourinary tract a BAAV vector comprising 
the nucleic acid genitourinary tract. 

26. The method of claim 25, wherein the epithelial barrier comprises human renal 
collecting ducts or proximal tubules. 

27. A method of transcytosing lung epithelial cells of a subject comprising contacting 
the lung epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. 

28. The method of claim 27, wherein the epithelial cells are human bronchial, tracheal, 
or upper airway epithelial cells. 

29. A method of transcytosing CNS epithelial cells of a subject comprising contacting 
the CNS epithelial cells of the subject with a BAAV vector comprising a 
heterologous nucleic acid. 

30. The method of claim 29, wherein the epithelial cells are human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

31. A method of transcytosing vascular epithelial cells of a subject comprising 
contacting the vascular epithelial cells of the subject with a BAAV vector 
comprising a heterologous nucleic acid. 

32. The method of claim 31, wherein the epithelial cells are human vascular endothelial 
cells of the blood brain barrier. 

33. A method of transcytosing genitourinary tract epithelial cells of a subject comprising 
contacting the genitourinary tract epithelial cells of the subject with a BAAV vector 
comprising a heterologous nucleic acid. 

34. The method of claim 33, wherein the epithelial cells are human endometrial or 
urinary tract epithelial cells. 
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35. A method of transcytosing genitourinary tract epithelial cells of a subject comprising 
contacting the kidney epithelial cells of the subject with a BAAV vector comprising 
a heterologous nucleic acid. 

36. The method of claim 35, wherein the epithelial cells are human renal collecting 
ducts or proximal tubules 

37. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
gut, comprising delivering to the gut an AAV5 vector comprising the nucleic acid. 

3 8 . The method of claim 3 7, wherein the epithelial barrier comprises human absorptive 
enterocytes. 

39. A method of transcytosing gut epithelial cells of a subject comprising contacting the 
gut epithelial cells of the subject with an AAV5 vector comprising a heterologous 
nucleic acid. 

40. The method of claim 39, wherein the epithelial cells are human absorptive 
enterocytes. 

41. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
gut, comprising delivering to the gut an AAV4 vector comprising the nucleic acid. 

42. The method of claim 4 1 , wherein the epithelial barrier comprises human absorptive 
enterocytes. 

43. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
lung, comprising delivering to the lung an AAV4 vector comprising the nucleic acid. 

44. The method of claim 43, wherein the epithelial barrier comprises human bronchial, 
tracheal, or upper airway epithelial cells. 

45. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
CNS, comprising delivering to the CNS an AAV4 vector comprising the nucleic 
acid. 

46. The method of claim 45, wherein the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

47. A method of delivering a heterologous nucleic acid across the epithelial banier of 
blood vessels into the muscle, comprising delivering to the blood stream an AAV4 
vector comprising the nucleic acid. 



74 



WO 2006/029196 PCT/US2005/031837 

48. The method of claim 47, wherein the epithelial barrier comprises human vascular 
endothelial cells of the blood brain barrier. 

49. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
genitourinary tract, comprising delivering to the genitourinary tract an AAV4 vector 
comprising the nucleic acid. 

50. The method of claim 49, wherein the epithelial barrier comprises human endometrial 
or urinary epithelial cells. 

51. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
kidneys, comprising delivering to the kidneys an AAV4 vector comprising the 
nucleic acid. 

52. The method of claim 5 1 , wherein the epithelial barrier comprises human renal 
collecting ducts or proximal tubules. 

53. A method of transcytosing lung epithelial cells of a subject comprising contacting 
the lung epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. 

54. The method of 53, wherein the epithelial cells are human bronchial, tracheal, or 
upper airway epithelial cells. 

55. A method of transcytosing CNS epithelial cells of a subject comprising contacting 
the CNS epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. 

56. The method of claim 55, wherein the epithelial cells are human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

57. A method of transcytosing vascular epithelial cells of a subject comprising 
contacting the vascular epithelial cells of the subject with an AAV4 vector 
comprising a heterologous nucleic acid. 

58. The method of claim 57, wherein the epithelial cells are vascular endothelial cells of 
the blood brain barrier. 

59. A method of transcytosing genitourinary tract epithelial cells of a subject comprising 
contacting the genitourinary epithelial cells of the subject with an AAV4 vector 
comprising a heterologous nucleic acid. 
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60. The method of claim 59, wherein the epithelial cells are human endometrial or 
urinary epithelial cells. 

61 . A method of transcytosing kidney epithelial cells of a subject comprising contacting 
the kidney epithelial cells of the subject with an AAV4 vector comprising a 
heterologous nucleic acid. 

62. The method of claim 61 , wherein the epithelial cells are human renal collecting 
ducts or proximal tubules 

63. A method of transcytosing gut epithelial cells of a subject comprising contacting the 
CNS epithelial cells of the subject with an AAV4 vector comprising a heterologous 
nucleic acid. 

64. The method of claim 63, wherein the epithelial cells are human absorptive 
enterocytes. 

65. A method of delivering a heterologous nucleic acid across an epithelial barrier in the 
brain, comprising delivering to the brain a AAV7 vector comprising the nucleic acid. 

66. The method of claim 65, wherein the epithelial barrier comprises human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 

67. A method of transcytosing CNS epithelial cells of a subject comprising contacting 
the CNS epithelial cells of the subject with a AAV7 vector comprising a 
heterologous nucleic acid. 

68. The method of claim 67, wherein the epithelial cells are human cerebral 
microvascular endothelial cells or Choroidal Plexus epithelial cells of the blood 
brain barrier. 
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SEQUENCE LISTING 

<110> The Government of the united states of America, as represented by The 

Secretary, Department of Health and Human services, National institutes of 
Heal th 

chiorini, John A. 
DePasquale, Giovanni 

<120> TRANSOTOSIS OF ADENO-ASSOCIATED VIRUSES 

<130> 14014/0427P1 

<140> TBA 

<141> 2005-09-08 

<150> 60/607,854 
<151> 2004-09-08 

<160> 72 

<170> FastSEQ for Windows version 4.0 

<210> 1 
<211> 4768 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<221> misc_feature 
<222> (0)...(0) 
<223> n=a,t,c, or g 

<221> variation 
<222> (0)..-(0) 
<223> xaa=any amino acid 

<400> 1 

ttggccactc cctctatgcg cgctcgctca ctcactcggc cctggagacc aaaggtctcc 60 

agactgccgg cctctggccg gcagggccga gtgagtgagc gagcgcgcat agagggagtg 120 

gccaactcca tcatctaggt ttgcccactg acgtcaatgt gacgtcctag ggttagggag 180 

gtccctgtat tagcagtcac gtgagtgtcg tatttcgcgg agcgtagcgg agcgcatacc 240 

aagctgccac gtcacagcca cgtggtccgt ttgcgacagt ttgcgacacc atgtggtcag 300 

gagggtatat aaccgcgagt gagccagcga ggagctccat tttgcccgcg aattttgaac 360 

gagcagcagc catgccgggg ttctacgaga tcgtgctgaa ggtgcccagc gacctggacg 420 

agcacctgcc cggcatttct gactcttttg tgagctgggt ggccgagaag gaatgggagc 480 

tgccgccgga ttctgacatg gacttgaatc tgattgagca ggcacccctg accgtggccg 540 

aaaagctgca acgcgagttc ctggtcgagt ggcgccgcgt gagtaaggcc ccggaggccc 600 

tcttctttgt ccagttcgag aagggggaca gctacttcca cctgcacatc ctggtggaga 660 

ccgtgggcgt caaatccatg gtggtgggcc gctacgtgag ccagattaaa gagaagctgg 720 

tgacccgcat ctaccgcggg gtcgagccgc agcttccgaa ctggttcgcg gtgaccaaga 780 

cgcgtaatgg cgccggaggc gggaacaagg tggtggacga ctgctacatc cccaactacc 840 

tgctccccaa gacccagccc gagctccagt gggcgtggac taacatggac cagtatataa 900 

gcgcctgttt gaatctcgcg gagcgtaaac ggctggtggc gcagcatctg acgcacgtgt 960 

cgcagacgca ggagcagaac aaggaaaacc agaaccccaa ttctgacgcg ccggtcatca 1020 

ggtcaaaaac ctccgccagg tacatggagc tggtcgggtg gctggtggac cgcgggatca 1080 

cgtcagaaaa gcaatggatc caggaggacc aggcgtccta catctccttc aacgccgcct 1140 

ccaactcgcg gtcacaaatc aaggccgcgc tggacaatgc ctccaaaatc atgagcctga 1200 

caaagacggc tccggactac ctggtgggcc agaacccgcc ggaggacatt tccagcaacc 1260 
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gcatctaccg 
tgggctgggc 
cgacgggtaa 
tgaactggac 
gggaggaggg 
gcaaggtgcg 
tcgtcacctc 
accaacaacc 
actttggcaa 
tgaccgaggt 
ccaatgacgc 
cgtcagacgc 
acgtgggtat 
tggacatttg 
ctcaacccgt 
tcatggggag 
atgactgtga 
gattggctag 
gcccctaaac 
ggttacaaat 
gacgcggcag 
ccctacctca 
ccgtttgggg 
cttggtctgg 
tccccccagc 
aagaagctcg 
tccggagcca 
ggsggacaag 
tggtctgagg 
aaccacctnt 
accccctggg 
cagcgactca 
aacatccagg 
accagcacgg 
ggtcaagagg 
tactgtggac 
tgcctggagt 
agttttgaga 
atgaaccctc 
ctgaatgccg 
tttaaaaaga 
aatcaaaact 
agcactctgg 
cctgcggaca 
aacacggcca 
aacgccaccg 
ctgccgaccg 
agagacattt 
cacccctcac 
aagaacaccc 
ttcattactc 
gagcggtcca 
tctctgttgt 
cgctacctca 
gttgaacttt 
taagcagcgg 
tctggcaaac 
ggccctggag 
agcgagcgcg 

<210> 2 
<211> 623 
<212> PRT 



aatcctcgag 
gcaaaagaag 
aaccaacatc 
caatgagaac 
caagatgacg 
cgtggaccaa 
caacaccaac 
actccaggac 
ggtcaccaag 
gactcacgag 
agatataagt 
ggaagctccg 
gaatctgatg 
cttcacgcac 
gtctgtcgtc 
ggcgcccgag 
catggaacaa 
aggacaacct 
ccaaggcaaa 
acctcggacc 
ccctcgagca 
agtacaacca 
gcaacctcgg 
ttgagcaagc 
agcccgactc 
ttttcgaaga 
tgtctgatga 
gtgccgatgg 
gccacgtcac 
acaagcgact 
gatactttga 
tcaacaacaa 
tcaaggaggt 
ttcagatctt 
gcagcctgcc 
tggtgaccgg 
actttccttc 
aggtgccttt 
tcatcgacca 
ggactgccac 
actggctgcc 
acaagatccc 
acggaagatg 
gcaagttcag 
ccgtacccgg 
atacggacat 
tggacagact 
actaccaggg 
cgctgattgg 
cggtacctgc 
agtacagcac 
aacgctggaa 
gggctcccga 
cccaccacct 
ggtctccgtg 
cctgcggcgc 
catgatgatg 
accaaaggtc 
catagaggga 



atgaacgggt 
ttcgggaaga 
gcggaagcca 
tttccgttca 
gccaaggtcg 
aagtgcaagt 
atgtgcgcgg 
cggatgttca 
caggaagtca 
ttttacgtca 
gagcccaagc 
gtggactacg 
ctttttccct 
ggggtcatgg 
agaaagcgga 
gtggcctgct 
taaatgactc 
ctctgaaggc 
tcaacaacat 
cggcaacgga 
cgacaaggcc 
cgccgacgcg 
cagagcagtc 
gggtgagacg 
ctccacgggt 
cgaaactgga 
cagtgagatg 
agtgggtaat 
gaccaccagc 
cggagagagc 
cttcaaccgc 
ctggggcatg 
cacgacgtcg 
tgcggactcg 
tccttttccc 
caacacttcg 
gcagatgctg 
ccactcgatg 
gtacctgtgg 
caccaacttt 
cgggccttca 
tgccaccggg 
gagtgccctg 
caacagccag 
gactctgatc 
gtggggcaac 
gacagccttg 
tcccatttgg 
tgggtttggg 
gaatcctgca 
tggccaggtg 
ccccgaggtc 
tgcggctggg 
gtaataacct 
tccttcttat 
ttgcgcttcg 
gagttggcca 
tccagactgc 
gtggccaa 



acgatccgca 
ggaacaccat 
tcgcccacgc 
acgattgcgt 
tagagagcgc 
catcggccca 
tcatcgacgg 
agttcgagct 
aagacttttt 
gaaagggtgg 
gggcctgtcc 
cggacaggta 
gccggcaatg 
actgtgccga 
cgtatcagaa 
cggcctgcga 
aaaccagata 
gttcgagagt 
caggacaacg 
ctcgacaagg 
tacgaccagc 
gagttccagc 
ttccaggcca 
gctcctggaa 
atcggcaaaa 
gcaggcgacg 
cgtgcagcag 
gcctcgggtg 
accagaacct 
ctgcagtcca 
ttccactgcc 
cgacccaaag 
aacggcgaga 
tcgtacgaac 
aacgacgtct 
cagcaacaga 
cggactggca 
tacgcgcaca 
ggactgcaat 
accaagctgc 
atcaagcagc 
tcagacagtc 
acccccggac 
ctcatctttg 
ttcacctctg 
ctacctggcg 
ggagccgtgc 
gccaagattc 
ctgaaacacc 
acgaccttca 
tcggtgcaga 
cagtttacct 
aaatacactg 
gttaatcaat 
cttatctcgt 
cggtttacaa 
ctccctctat 
cggcctctgg 



gtacgcggcc 
ctggctcttt 
cgtgcccttc 
cgacaagatg 
caaggccatc 
gatcgaccca 
aaactcgacc 
caccaagcgc 
ccggtgggcg 
agctagaaag 
gtcagttgcg 
ccaaaacaaa 
cgagagaatg 
gtgcttcccc 
actgtgtccg 
actggccaat 
tgactgacgg 
ggtgggcgct 
ctcggggtct 
gggaacccgt 
agctcaaggc 
agcggcttca 
aaaagagggt 
agaagagacc 
aaggcaagca 
gaccccctga 
ctggcggagc 
attggcattg 
gggtcttgcc 
acacctacaa 
acttctcacc 
ccatgcgggt 
caacggtggc 
tgccgtacgt 
ttatggtgcc 
ctgacagaaa 
acaactttga 
gccagagcct 
cgaccaccac 
ggcctaccaa 
agggcttctc 
tcatcaaata 
ctccaatggc 
cggggcctaa 
aggaggagct 
gtgaccagag 
ctggaatggt 
ctcataccga 
cgcctcctca 
gctctactcc 
ttgactggga 
ccaactacgg 
agcctagggc 
aaaccggttt 
ttccatggct 
ctgccggtta 
gcgcgctcgc 
ccggcagggc 



tccgtcttcc 
gggccggcca 
tacggctgcg 
gtgatctggt 
ctgggcggaa 
actcccgtga 
accttcgagc 
ctggagcacg 
tcagatcacg 
aggcccgccc 
cagccatcga 
tgttctcgtc 
aatcagaatg 
gtgtcagaat 
attcatcaca 
gtggacttgg 
ttaccttcca 
gcaacctgga 
tgtgcttccg 
caacgcagcg 
cggtgacaac 
gggcgacaca 
tcttgaacct 
gttgattgaa 
gccggctaaa 
gggatcaact 
tgcagtcgag 
cgattccacc 
cacctacaac 
cggattctcc 
acgtgactgg 
caaaatcttc 
taataacctt 
gatggatgcg 
ccagtacggc 
tgccttctac 
aattacgtac 
ggaccggctg 
cggaaccacc 
cttttccaac 
aaagactgcc 
cgagacgcac 
cacggctgga 
acagaacggc 
ggcagccacc 
caacagcaac 
ctggcaaaac 
tggacacttt 
aatttttatc 
ggtaaactcc 
gatccagaag 
acagcaaaac 
tatcggtacc 
attcgtttca 
actgcgtaca 
atcagtaact 
tcactcactc 
cgagtgagtg 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4768 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 2 






Glu lie Val 




Met 


Pro 


Gly 


Phe Tyr 
5 


Leu 


1 
Glu 


His 


Leu 


Pro Gly 


lie Ser Asp 


Ser 






20 


25 


Lys 


Glu 


Trp 


Glu Leu 


Pro Pro Asp 


Ser 




35 




40 




Glu 


Gin 


Ala 


Pro Leu 


Thr Val Ala 


Glu 




50 






55 


Ala 


Val 


Glu 


Trp 


Arq Arg 

W WW PI • w w J 


Val Ser Lys 


65 




70 


Phe 


Gin 


Phe 


GlU 


Lys Gly 


Asp Ser Tyr 








85 




Thr 


val 


Gly 


Val Lys 


ser Met val 


val 






100 




105 


Lys 


Glu 


Lys 


Leu Val 


Thr Arg lie 


Tyr 




115 




120 




pro 


Asn 


Trp 


Phe Ala 


Val Thr Lys 


Thr 




130 




135 


lie 


Asn 


Lys 


Val 


Val Asp 


Asp Cys Tyr 


145 






150 




Thr 


Gin 


Pro 


Glu Leu 


Gin Trp Ala 


Trp 








165 




Ser 


Ala 


cys 


Leu Asn 


Leu Ala Glu 


Arg 






180 




185 


Leu 


Thr 


His 


val ser 


Gin Thr Gin 


Glu 






195 




-» r\r\ 

200 




pro 


Asn 


Ser 


asd a! a 


Pro Val lie 


Arg 




210 




215 




Met 


Glu 


Leu 


Val Gly 


Trp Leu val 


Asp 


225 






230 




Gin 


Trp 


He 


Gin Glu 


Asp Gin Ala 

• 


Ser 






245 




Ala 


Ser 


Asn 


Ser 


Arg ser 


Gin lie Lys 








260 


265 


He 


Met 


ser 


Leu Thr 


Lys Thr Ala 


Pro 






275 




280 




Pro 


Pro 


Glu 


Asp lie 


Ser ser Asn 


Arg 




290 




295 


Ala 


Asn 


Gly 


Tyr 


Asp Pro 


Gin Tyr Ala 


305 


310 


Thr 


Gin 


Lys 


Lys 


Phe Gly 
325 


Lys Arg Asn 


Thr 


Thr 


Gly 


Lys Thr 


Asn lie Ala 


Glu 






340 




345 


Phe 


Tyr 


Gly 


cys Val 


Asn Trp Thr 


Asn 




355 


360 




cys 


val 


Asp 


Lys Met 


val lie Trp 


Trp 


370 


375 




Lys 


val 


Val 


Glu ser 


Ala Lys Ala 


He 


385 








390 


Ala 


val 


Asp 


Gin 


Lys cys 


Lys Ser Ser 






405 






He 


Val 


Thr 


ser Asn 


Thr Asn Met 


Cys 








420 




425 


Thr 


Thr 


Phe 


Glu His 


Gin Gin Pro 


Leu 






435 




440 





10 15 
Phe val ser Trp val Ala Glu 

30 

Asp Met Asp Leu Asn Leu lie 

45 

Lys Leu Gin Arg Glu Phe Leu 
60 

Pro Glu Ala Leu Phe Phe Val 
75 80 
His Leu His lie Leu val Glu 
90 95 
Gly Arg Tyr Val Ser Gin He 

110 

Arg Gly Val Glu Pro Gin Leu 

125 

Arg Asn Gly Ala Gly Gly Gly 
140 

Pro Asn Tyr Leu Leu Pro Lys 
155 160 
Thr Asn Met Asp Gin Tyr lie 
170 175 
Lys Arg Leu Val Ala Gin His 

190 

Gin Asn Lys Glu Asn Gin Asn 

205 

Ser Lys Thr Ser Ala Arg Tyr 
220 

Arg Gly lie Thr Ser Glu Lys 
235 240 
Tyr lie Ser Phe Asn Ala Ala 
250 255 
Ala Leu Asp Asn Ala ser Lys 

270 

Asp Tyr Leu val Gly Gin Asn 

285 

lie Tyr Arg lie Leu Glu Met 
300 

Ser Val Phe Leu Gly Trp Ala 
315 320 
lie Trp Leu Phe Gly Pro Ala 
330 335 
Ala lie Ala His Ala Val Pro 

350 

Glu Asn Phe Pro Phe Asn Asp 

365 

Glu Glu Gly Lys Met Thr Ala 
380 

Leu Gly Gly Ser Lys Val Arg 
395 400 
Gin lie Asp Pro Thr Pro Val 
410 415 
Ala Val lie Asp Gly Asn ser 

430 

Gin Asp Arg Met Phe Lys Phe 

445 
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Glu 


Leu 


Thr 


Lys Arg 


Leu 


Glu 




450 






455 


Glu 


val 


Lys 


Asp Phe 


Phe 


Arg 


465 






470 




Thr 


His 


Glu 


Phe Tyr val 


Arg 








485 




Pro 


Asn Asp 


Ala Asp lie 


Ser 








500 






Ala 


Gin 


Pro 


Ser Thr 


Ser 


Asp 






515 








Arg 


Tyr 


Gin 


Asn Lys 


Cys 


ser 


530 




Arg Gin 




535 


Phe 


pro 


Cys 


Cys 


Glu 


545 






550 




Phe 


Thr 


His 


Gly val 


Met 


Asp 








565 




Ser 


Gin 


Pro 


Val ser 


val 


Val 








580 






Pro 


He 


His 


His lie 


Met 


Gly 






595 




val 


cys 


Glu 


Leu 


Ala Asn 


Asp 


610 








615 



460 

Trp Ala ser Asp His val Thr Glu val 

475 480 
Lys Gly Gly Ala Arg Lys Arg Pro Ala 

490 495 
Glu Pro Lys Arg Ala cys Pro Ser Val 

505 510 
Ala Glu Ala Pro val Asp Tyr Ala Asp 
520 525 
Arg His Val Gly Met Asn Leu Met Leu 

540 

Arg Met Asn Gin Asn val Asp lie cys 

555 560 
cys Ala Glu Cys Phe Pro Val ser Glu 

570 575 
Arg Lys Arg Thr Tyr Gin Lys Leu Cys 

585 590 
Arg Ala Pro Glu Val Ala Cys Ser Ala 
600 605 
Leu Asp Asp cys Asp Met Glu Gin 

620 

<210> 3 
<211> 2495 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 3 , 

Ala Thr Gly Cys cys Gly Gly Gly Gly Thr Thr Cys Thr Ala Cys Gly 

15 10 15 

Ala Gly Ala Thr Cys Gly Thr Gly cys Thr Gly Ala Ala Gly Gly Thr 

20 25 30 

Gly cys cys Cys Ala Gly Cys Gly Ala cys Cys Thr Gly Gly Ala Cys 

35 40 45 

Met Pro Gly Phe Tyr Glu lie Val Leu Lys val Pro Ser Asp Leu Asp 

50 55 60 

Gly Ala Gly Cys Ala cys cys Thr Gly Cys Cys Cys Gly Gly cys Ala 
65 70 75 80 

Thr Thr Thr Cys Thr Gly Ala cys Thr Cys Thr Thr Thr Thr Gly Thr 

85 90 95 

Gly Ala Gly cys Thr Gly Gly Gly Thr Gly Gly Cys cys Gly Ala Gly 

100 105 110 

Glu His Leu Pro Gly lie Ser Asp Ser Phe val Ser Trp val Ala Glu 

115 120 125 

Ala Ala Gly Gly Ala Ala Thr Gly Gly Gly Ala Gly cys Thr Gly Cys 

130 135 140 

cys Gly Cys cys Gly Gly Ala Thr Thr cys Thr Gly Ala Cys Ala Thr 
145 150 155 160 

Gly Gly Ala cys Thr Thr Gly Ala Ala Thr cys Thr Gly Ala Thr Thr 

165 170 175 

Lys Glu Trp Glu Leu Pro Pro Asp ser Asp Met Asp Leu Asn Leu lie 

180 185 190 

Gly Ala Gly Cys Ala Gly Gly cys Ala cys cys Cys cys Thr Gly Ala 

195 200 205 

Cys cys Gly Thr Gly Gly cys cys Gly Ala Ala Ala Ala Gly cys Thr 

210 215 220 , , 

Gly cys Ala Ala cys Gly cys Gly Ala Gly Thr Thr cys Cys Thr Gly 
225 230 235 240 
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Glu Gin Ala Pro Leu Thr val Ala Glu Lys Leu fcin Arg Glu Phe Leu 

245 250 255 

Gly Thr Cys Gly Ala Gly Thr Gly Gly Cys Gly Cys Cys Gly cys Gly 

y 260 265 270 

Thr Gly Ala Gly Thr Ala Ala Gly Gly cys Cys cys cys Gly Gly Ala 

275 280 285 

Gly Gly cys cys cys Thr cys Thr Thr Cys Thr Thr Thr Gly Thr cys 

290 295 300 

Val Glu Trp Arg Arg val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
305 310 315 320 

Cys Ala Gly Thr Thr Cys Gly Ala Gly Ala Ala Gly Gly Gly Gly Gly 

325 330 335 

Ala Cys Ala Gly cys Thr Ala Cys Thr Thr Cys cys Ala cys Cys Thr 

340 345 350 

Gly cys Ala cys Ala Thr Cys cys Thr Gly Gly Thr Gly Gly Ala Gly 

355 360 365 

Gin Phe Glu Lys Gly Asp ser Tyr Phe His Leu His lie Leu val Glu 

370 375 380 

Ala cys cys Gly Thr Gly Gly Gly cys Gly Thr Cys Ala Ala Ala Thr 
385 390 395 400 

Cys cys Ala Thr Gly Gly Thr Gly Gly Thr Gly Gly Gly cys Cys Gly 

y 405 410 415 

Cys Thr Ala Cys Gly Thr Gly Ala Gly cys cys Ala Gly Ala Thr Thr 

420 425 430 

Thr val Gly val Lys ser Met val Val Gly Arg Tyr val ser Gin lie 

435 440 445 

Ala Ala Ala Gly Ala Gly Ala Ala Gly cys Thr Gly Gly Thr Gly Ala 

450 ' 455 460 

cys cys cys Gly cys Ala Thr cys Thr Ala Cys Cys Gly Cys Gly Gly 
465 470 475 , 480 

Gly Gly Thr Cys Gly Ala Gly Cys cys Gly Cys Ala Gly Cys Thr Thr 

485 490 495 

Lys Glu Lys Leu val Thr Arg lie Tyr Arg Gly val Glu Pro Gin Leu 

500 " 505 510 

Cys cys Gly Ala Ala Cys Thr Gly Gly Thr Thr Cys Gly Cys Gly Gly 

515 520 525 

Thr Gly Ala cys cys Ala Ala Gly Ala Cys Gly cys Gly Thr Ala Ala 

530 535 540 

Thr Gly Gly cys Gly cys Cys Gly Gly Ala Gly Gly Cys Gly Gly Gly 
545 550 ( 555 560 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

565 570 575 

Ala Ala cys Ala Ala Gly Gly Thr Gly Gly Thr Gly Gly Ala cys Gly 

580 585 590 

Ala cys Thr Gly Cys Thr Ala cys Ala Thr cys Cys Cys cys Ala Ala 

595 ' 600 605 

cys Thr Ala cys cys Thr Gly cys Thr cys cys cys cys Ala Ala Gly 

610 * 615 620 

Asn Lys val Val Asp Asp Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
625 630 635 640 

Ala cys cys cys Ala Gly cys cys cys Gly Ala Gly Cys Thr Cys Cys 

645 650 655 

Ala Gly Thr Gly Gly Gly Cys Gly Thr Gly Gly Ala cys Thr Ala Ala 

660 665 670 

Cys Ala Thr Gly Gly Ala Cys cys Ala Gly Thr Ala Thr Ala Thr Ala 

675 680 685 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Asp Gin Tyr lie 

690 695 700 

Ala Gly Cys Gly Cys cys Thr Gly Thr Thr Thr Gly Ala Ala Thr Cys 
705 710 715 720 

Thr cys Gly cys Gly Gly Ala Gly Cys Gly Thr Ala Ala Ala cys Gly 

725 730 735 

Gly cys Thr Gly Gly Thr Gly Gly Cys Gly Cys Ala Gly cys Ala Thr 
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740 745 750 

Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 

755 760 " 765 

Cys Thr Gly Ala cys Gly Cys Ala cys Gly Thr Gly Thr Cys Gly Cys 

770 775 780 

Ala Gly Ala cys Gly cys Ala Gly Gly Ala Gly cys Ala Gly Ala Ala 
785 790 795 800 

cys Ala Ala Gly Gly Ala Ala Ala Ala cys Cys Ala Gly Ala Ala cys 

805 810 815 

Leu Thr His val ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

820 825 830 

cys Cys Cys Ala Ala Thr Thr cys Thr Gly Ala cys Gly Cys Gly cys 

835 840 845 

cys Gly Gly Thr cys Ala Thr cys Ala Gly Gly Thr cys Ala Ala Ala 

850 855 860 

Ala Ala cys Cys Thr Cys cys Gly cys cys Ala Gly Gly Thr Ala Cys 
865 870 875 880 

Pro Asn ser Asp Ala Pro Val lie Arg Ser Lys Thr ser Ala Arg Tyr 

885 890 895 

Ala Thr Gly Gly Ala Gly Cys Thr Gly Gly Thr Cys Gly Gly Gly Thr 

900 905 910 

Gly Gly cys Thr Gly Gly Thr Gly Gly Ala Cys cys Gly cys Gly Gly 

915 920 925 

Gly Ala Thr cys Ala Cys Gly Thr Cys Ala Gly Ala Ala Ala Ala Gly 

930 935 940 

Met Glu Leu val Gly Trp Leu Val Asp Arg Gly lie Thr Ser Glu Lys 
945 950 955 960 

Cys Ala Ala Thr Gly Gly Ala Thr Cys Cys Ala Gly Gly Ala Gly Gly 

965 970- 975 

Ala cys Cys Ala Gly Gly cys Gly Thr cys cys Thr Ala Cys Ala Thr 

980 985 990 

Cys Thr cys cys Thr Thr Cys Ala Ala cys Gly cys Cys Gly cys cys 

995 1000 1005 

Gin Trp lie Gin Glu Asp Gin Ala ser Tyr lie Ser Phe Asn Ala Ala 

1010 1015 1020 

Thr Cys Cys Ala Ala Cys Thr Cys Gly cys Gly Gly Thr cys Ala Cys 
1025 1030 1035 1040 

Ala Ala Ala Thr cys Ala Ala Gly Gly Cys Cys Gly Cys Gly Cys Thr 

1045 1050 1055 

Gly Gly Ala cys Ala Ala Thr Gly Cys cys Thr cys cys Ala Ala Ala 

1060 1065 1070 

ser Asn ser Arg ser Gin lie Lys Ala Ala Leu Asp Asn Ala Ser Lys 

1075 " 1080 1085 

Ala Thr cys Ala Thr Gly Ala Gly cys cys Thr Gly Ala cys Ala Ala 

1090 1095 1100 

Ala Gly Ala Cys Gly Gly Cys Thr Cys cys Gly Gly Ala cys Thr Ala 
1105 1110 1115 1120 

Cys Cys Thr Gly Gly Thr Gly Gly Gly cys Cys Ala Gly Ala Ala cys 

1125 1130 1135 

lie Met ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Asn 

1140 1145 1150 

Cys cys Gly cys cys Gly Gly Ala Gly Gly Ala cys Ala Thr Thr Thr 

1155 1160 1165 

cys cys Ala Gly cys Ala Ala cys Cys Gly cys Ala Thr cys Thr Ala 

1170 1175 1180 t n 

Cys cys Gly Ala Ala Thr Cys Cys Thr Cys Gly Ala Gly Ala Thr Gly 
1185 1190 1195 1200 

Pro Pro Glu Asp lie Ser Ser Asn Arg lie Tyr Arg lie Leu Glu Met 

1205 1210 1215 

Ala Ala Cys Gly Gly Gly Thr Ala Cys Gly Ala Thr Cys Cys Gly Cys 

1220 1225 1230 

Ala Gly Thr Ala Cys Gly Cys Gly Gly Cys cys Thr Cys cys Gly Thr 
1235 1240 1245 
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Cvs Thr Thr cys Cys Thr Gly Gly Gly cys Thr Gly Gly Gly Cys Gly 

1250 1255 1260 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala ser val Phe Leu Gly Trp Ala 
1265 1270 1275 1280 

Cvs Ala Ala Ala Ala Gly Ala Ala Gly Thr Thr Cys Gly Gly Gly Ala 

7 1285 1290 1295 

Ala Gly Ala Gly Gly Ala Ala Cys Ala cys cys Ala Thr cys Thr Gly 

1300 1305 1310 

Gly cys Thr cys Thr Thr Thr Gly Gly Gly cys Cys Gly Gly Cys cys 

7 1315 1320 1325 

Gin Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

1330 1335 1340 

Ala cys Gly Ala cys Gly Gly Gly Thr Ala Ala Ala Ala Cys Cys Ala 
1345 1350 1355 1360 

Ala Cys Ala Thr Cys Gly cys Gly Gly Ala Ala Gly cys cys Ala Thr 

1365 1370 1375 

cys Gly Cys cys cys Ala cys Gly cys cys Gly Thr Gly cys Cys Cys 

1380 1385 1390 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala He Ala His Ala val pro 

1395 1400 1405 

Thr Thr Cys Thr Ala Cys Gly Gly Cys Thr Gly cys Gly Thr Gly Ala 

1410 1415 1420 

Ala cys Thr Gly Gly Ala cys Cys Ala Ala Thr Gly Ala Gly Ala Ala 
1425 1430 1435 n n 1440 

cys Thr Thr Thr cys Cys Gly Thr Thr cys Ala Ala Cys Gly Ala Thr 
7 1445 1450 1455 

Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

1460 1465 1470 

Thr Gly Cys Gly Thr cys Gly Ala cys Ala Ala Gly Ala Thr Gly Gly 

1475 1480 n 1485 

Thr Gly Ala Thr cys Thr Gly Gly Thr Gly Gly Gly Ala Gly Gly Ala 

1490 1495 n 1500 

Gly Gly Gly cys Ala Ala Gly Ala Thr Gly Ala cys Gly Gly Cys cys 
1505 1510 1515 1520 

cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 

1525 1530 1535 

Ala Ala Gly Gly Thr cys Gly Thr Ala Gly Ala Gly Ala Gly Cys Gly 

1540 1545 1550 

cys cys Ala Ala Gly Gly Cys Cys Ala Thr cys cys Thr Gly Gly Gly 

1555 1560 1565 

cys Gly Gly Ala Ala Gly cys Ala Ala Gly Gly Thr Gly cys Gly cys 

1570 1575 1580 

Lys val val Glu ser Ala Lys Ala lie Leu Gly Gly Ser Lys val Arg 
1585 1590 1595 1600 

Gly Thr Gly Gly Ala cys cys Ala Ala Ala Ala Gly Thr Gly cys Ala 

1605 1610 1615 

Ala Gly Thr Cys Ala Thr Cys Gly Gly cys Cys Cys Ala Gly Ala Thr 

1620 1625 1630 

cys Gly Ala Cys cys Cys Ala Ala cys Thr Cys cys cys Gly thr Gly 

1635 * 1640 1645 

val Asp Gin Lys cys Lys ser ser Ala Gin lie Asp Pro Thr Pro val 

1650 1655 1660 

Ala Thr cys Gly Thr cys Ala cys cys Thr Cys cys Ala Ala Cys Ala 
1665 1670 1675 1680 

cys cys Ala Ala cys Ala Thr Gly Thr Gly Cys Gly cys Gly Gly Thr 

77 1685 1690 1695 

Cys Ala Thr cys Gly Ala cys Gly Gly Ala Ala Ala Cys Thr Cys Gly 

1700 1705 1710 

He val Thr Ser Asn Thr Asn Met cys Ala val lie Asp Gly Asn ser 

1715 1720 1725 

Ala cys Cys Ala cys cys Thr Thr Cys Gly Ala Gly cys Ala cys cys 

1730 1735 1740 

Ala Ala cys Ala Ala cys cys Ala cys Thr Cys cys Ala Gly Gly Ala 
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1745 1750 1755 1760 

Cvs CVS Gly Gly Ala Thr Gly Thr Thr Cys Ala Ala Gly Thr Thr Cys 

y 7 1765 1770 1775 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

1780 1785 1790 

Gly Ala Gly Cys Thr Cys Ala Cys Cys Ala Ala Gly cys Gly Cys Cys 

y 1795 1800 1805 

Thr Gly Gly Ala Gly Cys Ala Cys Gly Ala Cys Thr Thr Thr Gly Gly 

1810 1815 1820 

cvs Ala Ala Gly Gly Thr cys Ala cys Cys Ala Ala Gly Cys Ala Gly 
1825 1830 1835 1840 

Glu Leu Thr Lys Arg Leu Glu His Asp Phe Gly Lys Val Thr Lys Gin 

1845 1850 1855 

Gly Ala Ala Gly Thr Cys Ala Ala Ala Gly Ala Cys Thr Thr Thr Thr 

1860 1865 1870 

Thr cys cys Gly Gly Thr Gly Gly Gly cys Gly Thr Cys Ala Gly Ala 

1875 1880 1885 

Thr cys Ala cys Gly Thr Gly Ala cys cys Gly Ala Gly Gly Thr Gly 

1890 1895 1900 

Glu Val Lys Asp Phe Phe Arg Trp Ala ser Asp His Val Thr Glu Val 
1905 1910 1915 1920 

Ala cys Thr Cys Ala cys Gly Ala Gly Thr Thr Thr Thr Ala cys Gly 

1925 1930 1935 

Thr Cys Ala Gly Ala Ala Ala Gly Gly Gly Thr Gly Gly Ala Gly cys 

1940 1945 1950 

Thr Ala Gly Ala Ala Ala Gly Ala Gly Gly cys cys Cys Gly Cys Cys 

1955 1960 1965 

Thr His Glu Phe Tyr Val Arg Lys Gly Gly Ala Arg Lys Arg Pro Ala 

1970 1975 1980 

Cys Cys cys Ala Ala Thr Gly Ala Cys Gly cys Ala Gly Ala Thr Ala 
1985 1990 1995 2000 

Thr Ala Ala Gly Thr Gly Ala Gly cys cys cys Ala Ala Gly Cys Gly 

2005 2010 2015 

Gly Gly Cys Cys Thr Gly Thr Cys Cys Gly Thr cys Ala Gly Thr Thr 

2020 2025 2030 

Pro Asn Asp Ala Asp lie Ser Glu Pro Lys Arg Ala cys Pro ser Val 

2035 2040 2045 

Gly cys Gly Cys Ala Gly Cys Cys Ala Thr Cys Gly Ala Cys Gly Thr 

2050 2055 2060 

cys Ala Gly Ala Cys Gly Cys Gly Gly Ala Ala Gly Cys Thr cys cys 
2065 2070 2075 2080 

Gly Gly Thr Gly Gly Ala cys Thr Ala cys Gly cys Gly Gly Ala Cys 

2085 2090 2095 

Ala Gin Pro Ser Thr Ser Asp Ala Glu Ala Pro val Asp Tyr Ala Asp 

2100 2105 2110 

Ala Gly Gly Thr Ala Cys Cys Ala Ala Ala Ala cys Ala Ala Ala Thr 

2115 2120 2125 

Gly Thr Thr cys Thr cys Gly Thr cys Ala cys Gly Thr Gly Gly Gly 

2130 2135 2140 

Thr Ala Thr Gly Ala Ala Thr Cys Thr Gly Ala Thr Gly Cys Thr Thr 
2145 2150 2155 2160 

Arg Tyr Gin Asn Lys Cys Ser Arg His val Gly Met Asn Leu Met Leu 

2165 2170 2175 

Thr Thr Thr cys Cys cys Thr Gly cys cys Gly Gly cys Ala Ala Thr 

2180 2185 2190 

Gly Cys Gly Ala Gly Ala Gly Ala Ala Thr Gly Ala Ala Thr cys Ala 

2195 2200 2205 

Gly Ala Ala Thr Gly Thr Gly Gly Ala cys Ala Thr Thr Thr Gly Cys 

2210 2215 2220 

Phe Pro Cys Arg Gin cys Glu Arg Met Asn Gin Asn Val Asp lie Cys 
2225 2230 2235 2240 

Thr Thr Cys Ala cys Gly cys Ala cys Gly Gly Gly Gly Thr cys Ala 

2245 2250 2255 
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Thr Gly Gly Ala Cys Thr Gly Thr Gly Cys cys Gly Ala Gly Thr Gly 

2260 2265 2270 

cys Thr Thr Cys Cys Cys cys Gly Thr Gly Thr cys Ala Gly Ala Ala 

2275 * 2280 2285 

Phe Thr His Gly val Met Asp cys Ala Glu Cys Phe Pro val ser Glu 

2290 2295 2300 

Thr cys Thr cys Ala Ala cys cys cys Gly Thr Gly Thr cys Thr Gly 
2305 " 2310 2315 n ^ ^ 2320 

Thr cys Gly Thr cys Ala Gly Ala Ala Ala Gly cys Gly Gly Ala cys 

2325 2330 2335 

Gly Thr Ala Thr Cys Ala Gly Ala Ala Ala cys Thr Gly Thr Gly Thr 

2340 2345 2350 

ser Gin Pro Val Ser val val Arg Lys Arg Thr Tyr Gin Lys Leu Cys 

2355 2360 2365 

Cys cys Gly Ala Thr Thr Cys Ala Thr cys Ala Cys Ala Thr Cys Ala 

2370 2375 2380 

Thr Gly Gly Gly Gly Ala Gly Gly Gly cys Gly cys Cys cys Gly Ala 
2385 2390 2395 2400 

Gly Gly Thr Gly Gly Cys Cys Thr Gly cys Thr cys Gly Gly Cys cys 

2405 2410 2415 

pro lie His His He Met Gly Arg Ala Pro Glu val Ala cys ser Ala 

2420 2425 2430 

Thr Gly Cys Gly Ala Ala Cys Thr Gly Gly cys cys Ala Ala Thr Gly 

2435 2440 2445 

Thr Gly Gly Ala Cys Thr Thr Gly Gly Ala Thr Gly Ala Cys Thr Gly 

2450 2455 2460 

Thr Gly Ala cys Ala Thr Gly Gly Ala Ala cys Ala Ala Thr Ala Ala 
2465 2470 2475 2480 

cys Glu Leu Ala Asn val Asp Leu Asp Asp cys Asp Met Glu Gin 
y 2485 2490 2495 



<210> 4 
<211> 734 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 4 , „ . 

Met Thr Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu ser Glu 

15 10 15 

Gly Val Arg Glu Trp Trp Ala Leu Gin Pro Gly Ala Pro Lys Pro Lys 

20 25 30 

Ala Asn Gin Gin His Gin Asp Asn Ala Arg Gly Leu Val Leu Pro Gly 

35 40 45 

Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro val 

50 55 60 

Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp Gin 
65 70 75 . 80 

Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 

85 90 95 

Ala Glu Phe Gin Gin Arg Leu Gin Gly Asp Thr Ser Phe Gly Gly. Asn 

100 105 n 110 

Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg val Leu Glu Pro Leu 

115 120 125 

Gly Leu val Glu Gin Ala Gly Glu Thr Ala Pro Gly Lys Lys Arg Pro 

130 135 140 

Leu lie Glu Ser Pro Gin Gin Pro Asp ser Ser Thr Gly lie Gly Lys 
145 150 155 160 

Lys Gly Lys Gin Pro Ala Lys Lys Lys Leu val Phe Glu Asp Glu Thr 

165 170 175 
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Glv Ala Gly Asp Gly Pro Pro. Glu Gly Ser Thr Ser Gly Ala Met ser 

180 185 190 

asp Asp Ser Glu Met Arg Ala Ala Ala Gly Gly Ala Ala Val Glu Gly 

195 200 205 

Gly Gin Gly Ala Asp Gly val Gly Asn Ala Ser Gly Asp Trp His Cys 

210 215 220 

Asp ser Thr Trp Ser Glu Gly His Val Thr Thr Thr ser Thr Arg Thr 
225 230 235 n 240 

Tro val Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Arg Leu Gly Glu 

K 245 250 255 

Ser Leu Gin Ser Asn Thr Tyr Asn Gly Phe Ser Thr Pro Trp Gly Tyr 

260 265 270 

Phe Asp Phe Asn Arg Phe His Cys His Phe ser Pro Arg Asp Trp Gin 

275 280 285 

Arq Leu lie Asn Asn Asn Trp Gly Met Arg Pro Lys Ala Met Arg val 

290 295 300 

Lys lie Phe Asn lie Gin val Lys Glu val Thr Thr ser Asn Gly Glu 
305 310 315 320 

Thr Thr val Ala Asn Asn Leu Thr ser Thr val Gin lie Phe Ala Asp 

325 330 335 

ser ser Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Glu Gly ser 

340 345 350 

Leu Pro Pro Phe Pro Asn Asp Val Phe Met val Pro Gin Tyr Gly Tyr 

355 360 365 

cys Gly Leu val Thr Gly Asn Thr Ser Gin Gin Gin Thr Asp Arg Asn 

370 375 380 

Ala Phe Tyr Cys Leu Glu Tyr Phe Pro ser Gin Met Leu Arg Thr Gly 
385 390 395 . 400 

Asn Asn Phe Glu lie Thr Tyr Ser Phe Glu Lys Val Pro Phe His Ser 

405 410 415 

Met Tyr Ala His ser Gin ser Leu Asp Arg Leu Met Asn Pro Leu lie 

420 425 ~ 430 

Asp Gin Tyr Leu Trp Gly Leu Gin Ser Thr Thr Thr Gly Thr Thr Leu 

435 440 445 

Asn Ala Gly Thr Ala Thr Thr Asn Phe Thr Lys Leu Arg Pro Thr Asn 

450 455 460 

Phe Ser Asn Phe Lys Lys Asn Trp Leu Pro Gly Pro Ser lie Lys Gin 
465 470 475 480 

Gin Gly Phe ser Lys Thr Ala Asn Gin Asn Tyr Lys He Pro Ala Thr 

485 490 495 

Gly ser Asp ser Leu lie Lys Tyr Glu Thr His Ser Thr Leu Asp Gly 

500 505 . 510 

Arg Trp ser Ala Leu Thr Pro Gly Pro Pro Met Ala Thr Ala Gly Pro 

515 520 525 

Ala Asp Ser Lys Phe Ser Asn Ser Gin Leu lie Phe Ala Gly Pro Lys 

530 535 540 

Gin Asn Gly Asn Thr Ala Thr Val Pro Gly Thr Leu He Phe Thr ser 
545 550 555 560 

Glu Glu Glu Leu Ala Ala Thr Asn Ala Thr Asp Thr Asp Met Trp Gly 

565 570 575 

Asn Leu Pro Gly Gly Asp Gin Ser Asn ser Asn Leu Pro Thr val Asp 

580 585 590 

Arg Leu Thr Ala Leu Gly Ala val Pro Gly Met val Trp Gin Asn Arg 

595 600 605 . 

Asp lie Tyr Tyr Gin Gly Pro He Trp Ala Lys lie Pro His Thr Asp 

610 615 620 ^ 

Gly His Phe His Pro Ser Pro Leu lie Gly Gly Phe Gly Leu Lys His 
625 630 635 n 640 

Pro Pro Pro Gin lie Phe lie Lys Asn Thr Pro val Pro Ala Asn Pro 

645 650 655 

Ala Thr Thr Phe ser Ser Thr Pro val Asn ser Phe lie Thr Gin Tyr 

660 665 670 

ser Thr Gly Gin val Ser Val Gin He Asp Trp Glu lie Gin Lys Glu 
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675 680 685 

Ara ser Lys Arg Trp Asn Pro Glu val Gin Phe Thr ser Asn Tyr Gly 

690 695 700 

Gin Gin Asn Ser Leu Leu Trp Ala Pro Asp Ala Ala Gly Lys Tyr Thr 
705 710 715 720 

Glu Pro Arg Ala lie Gly Thr Arg Tyr Leu Thr His His Leu 

725 730 

<210> 5 
<211> 2208 . 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<221> misc_feature 
<222> (0)...(0) 
<223> n=a,t,c, or g 

<221> variation 
<222> (0)...(0) 
<223> xaa = any amino acid 

<400> 5 Crt 
atgactgacg gttaccttcc agattggcta gaggacaacc tctctgaagg cgttcgagag 60 

tggtgggcgc tgcaacctgg agcccctaaa cccaaggcaa atcaacaaca tcaggacaac 1/0 

gctcggggtc ttgtgcttcc gggttacaaa tacctcggac ccggcaacgg actcgacaag 180 

ggggaacccg tcaacgcagc ggacgcggca gccctcgagc acgacaaggc ctacgaccag 240 

cagctcaagg ccggtgacaa cccctacctc aagtacaacc acgccgacgc ggagttccag 300 

cagcggcttc agggcgacac atcgtttggg ggcaacctcg gcagagcagt cttccaggcc 360 

aaaaagaggg ttcttgaacc tcttggtctg gttgagcaag cgggtgagac ggctcctgga 420 

aagaagagac cgttgattga atccccccag cagcccgact cctccacggg tatcggcaaa 480 

aaaggcaagc agccggctaa aaagaagctc gttttcgaag acgaaactgg agcaggcgac 540 

ggaccccctg agggatcaac ttccggagcc atgtctgatg acagtgagat gcgtgcagca 600 

gctggcggag ctgcagtcga gggsggacaa ggtgccgatg gagtgggtaa tgcctcgggt 660 

gattggcatt gcgattccac ctggtctgag ggccacgtca cgaccaccag caccagaacc 720 

tgggtcttgc ccacctacaa caaccacctn tacaagcgac tcggagagag cctgcagtcc 780 
aacacctaca acggattctc caccccctgg ggatactttg acttcaaccg cttccactgc 
cacttctcac cacgtgactg gcagcgactc atcaacaaca actggggcat gcgacccaaa 
gccatgcggg tcaaaatctt caacatccag gtcaaggagg tcacgacgtc gaacggcgag 

acaacggtgg ctaataacct taccagcacg gttcagatct ttgcggactc gtcgtacgaa 1020 

ctgccgtacg tgatggatgc gggtcaagag ggcagcctgc ctccttttcc caacgacgtc 1080 

tttatggtgc cccagtacgg ctactgtgga ctggtgaccg gcaacacttc gcagcaacag 1140 

actgacagaa atgccttcta ctgcctggag tactttcctt cgcagatgct gcggactggc 1200 

aacaactttg aaattacgta cagttttgag aaggtgcctt tccactcgat gtacgcgcac 1260 

agccagagcc tggaccggct gatgaaccct ctcatcgacc agtacctgtg gggactgcaa 13 ?0 

tcgaccacca ccggaaccac cctgaatgcc gggactgcca ccaccaactt taccaagctg 1380 

cggcctacca acttttccaa ctttaaaaag aactggctgc ccgggccttc aatcaagcag 1440 

cagggcttct caaagactgc caatcaaaac tacaagatcc ctgccaccgg gtcagacagt 1500 

ctcatcaaat acgagacgca cagcactctg gacggaagat ggagtgccct gacccccgga 1560 

cctccaatgg ccacggctgg acctgcggac agcaagttca gcaacagcca gctcatcttt 1620 

gcggggccta aacagaacgg caacacggcc accgtacccg ggactctgat cttcacctct 1680 

gaggaggagc tggcagccac caacgccacc gatacggaca tgtggggcaa cctacctggc 1740 

ggtgaccaga gcaacagcaa cctgccgacc gtggacagac tgacagcctt gggagccgtg 1800 

cctggaatgg tctggcaaaa cagagacatt tactaccagg gtcccatttg ggccaagatt 1860 

cctcataccg atggacactt tcacccctca ccgctgattg gtgggtttgg gctgaaacac 1920 

ccgcctcctc aaatttttat caagaacacc ccggtacctg cgaatcctgc aacgaccttc 1980 

agctctactc cggtaaactc cttcattact cagtacagca ctggccaggt gtcggtgcag 2040 

attgactggg agatccagaa ggagcggtcc aaacgctgga accccgaggt ccagtttacc 2100 

tccaactacg gacagcaaaa ctctctgttg tgggctcccg atgcggctgg gaaatacact 2160 

gagcctaggg ctatcggtac ccgctacctc acccaccacc tgtaataa 2208 



840 
900 
960 
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<210> 6 
<211> 125 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 6 

ttggccactc cctctatgcg cgctcgctca ctcactcggc cctggagacc aaaggtctcc 



60 
20 
125 



agactgccgg cctctggccg gcagggccga gtgagtgagc gagcgcgcat agagggagtg 120 
gccaa 

<210> 7 
<211> 245 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 



60 



<400> 7 

ctccatcatc taggtttgcc cactgacgtc aatgtgacgt cctagggtta gggaggtccc 

tgtattagca gtcacgtgag tgtcgtattt cgcggagcgt agcggagcgc ataccaagct 120 

gccacgtcac agccacgtgg tccgtttgcg acagtttgcg acaccatgtg gtcaggaggg 180 

tatataaccg cgagtgagcc agcgaggagc tccattttgc ccgcgaattt tgaacgagca 240 

gcagc Z4 - > 

<210> 8 
<211> 313 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 8 , -| . 

Met Glu Leu val Gly Trp Leu Val Asp Arg Gly lie Thr Ser Glu Lys 

1 5 10 15 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie ser Phe Asn Ala Ala 

20 25 30 

ser Asn ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala ser Lys 

35 40 45 

lie Met ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Asn 

50 55 60 

Pro Pro Glu Asp lie Ser Ser Asn Arg lie Tyr Arg lie Leu Glu Met 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Gin Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

100 " 105 HO 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Ala val Pro 

115 120 t 125 

Phe Tyr Gly cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

cys Val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

Lys val val Glu ser Ala Lys Ala lie Leu Gly Gly ser Lys Val Arg 

165 170 175 
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val Asp Gin Lys cys Lys Ser ser Ala Gin lie Asp pro Thr Pro Val 

180 185 190 

lie Val Thr Ser Asn Thr Asn Met cys Ala val lie Asp Gly Asn ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Lys Arg Leu Glu His Asp Phe Gly Lys val Thr Lys Gin 
225 230 235 240 

Glu val Lys Asp Phe Phe Arg Trp Ala Ser Asp His val Thr Glu Val 

245 250 255 

Thr His Glu Phe Tyr val Arg Lys Gly Gly Ala Arg Lys Arg Pro Ala 

260 265 270 

pro Asn Asp Ala Asp He ser Glu Pro Lys Arg Ala Cys Pro ser Val 

275 280 n 285 

Ala Gin Pro ser Thr Ser Asp Ala Glu Ala Pro val Asp Tyr Ala Asp 

290 295 300 

Arg Leu Ala Arg Gly Gin Pro Leu Xaa 
305 310 

<210> 9 
<211> 399 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 9 H 

Met Glu Leu val Gly Trp Leu val Asp Arg Gly lie Thr ser Glu Lys 

1 5 10 15 

Gin Trp lie Gin Glu Asp Gin Ala ser Tyr lie Ser Phe Asn Ala Ala 

20 25 30 

ser Asn ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Ser Lys 

35 40 45 

He Met ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Asn 

50 55 60 

pro Pro Glu Asp lie Ser Ser Asn Arg lie Tyr Arg lie Leu Glu Met 
65 70 75 80 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser Val Phe Leu Gly Trp Ala 

85 90 95 

Gin Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

100 105 n 110 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Ala Val Pro 

115 120 125 

Phe Tyr Gly Cys val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

cys val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met Thr Ala 
145 150 155 160 

Lys Val val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg 

165 u 175 

val Asp Gin Lys Cys Lys ser Ser Ala Gin lie Asp Pro Thr Pro Val 

180 185 190 

lie Val Thr Ser Asn Thr Asn Met Cys Ala val lie Asp Gly Asn Ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

210 215 220 

Glu Leu Thr Lys Arg Leu Glu His Asp Phe Gly Lys val Thr Lys Gin 

225 230 235 240 

Glu val Lys Asp Phe Phe Arg Trp Ala ser Asp His val Thr Glu val 

245 250 255 

Thr His Glu Phe Tyr val Arg Lys Gly Gly Ala Arg Lys Arq Pro Ala 

260 " 265 270 
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Pro Asn Asp Ala Asp lie ser Glu Pro Lys Arg Ala cys pro ser val 

275 280 n 285 

Ala Gin Pro ser Thr Ser Asp Ala Glu Ala Pro Val Asp Tyr Ala Asp 

290 295 300 

Arq Tyr Gin Asn Lys Cys ser Arg His Val Gly Met Asn Leu Met Leu 

305 310 315 320 

Phe Pro Cys Arg Gin cys Glu Arg Met Asn Gin Asn Val Asp lie Cys 

325 " 330 n 335 

Phe Thr His Gly Val Met Asp Cys Ala Glu Cys Phe Pro Val Ser Glu 

340 " 345 350 

Ser Gin Pro val ser val val Arg Lys Arg Thr Tyr Gin Lys Leu cys 

355 360 365 

Pro He His His He Met Gly Arg Ala Pro Glu val Ala cys Ser Ala 

370 375 ~ 380 

Cys Glu Leu Ala Asn Val Asp Leu Asp Asp cys Asp Met Glu Gin 

385 390 395 

<210> 10 
<211> 537 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 10 

Met Pro Gly Phe Tyr Glu lie Val Leu Lys Val Pro ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie Ser Asp ser Phe Val Ser Trp Val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp ser Asp Met Asp Leu Asn Leu lie 

35 40 45 

Glu Gin Ala Pro Leu Thr val Ala Glu Lys Leu Gin Arg Glu Phe Leu 

50 55 60 

Val Glu Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 ~ 70 75 80 

Gin Phe Glu Lys Gly Asp Ser Tyr Phe His Leu His lie Leu Val Glu 

85 90 95 

Thr Val Gly Val Lys Ser Met Val val Gly Arg Tyr val ser Gin lie 

100 105 110 

Lys Glu Lys Leu val Thr Arg lie Tyr Arg Gly val Glu Pro Gin Leu 

115 120 125 

Pro Asn Trp Phe Ala Val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys Val Val Asp Asp Cys Tyr lie pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Asp Gin Tyr He 

165 170 175 . 

Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu val Ala Gin His 

180 185 190 

Leu Thr His val ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

pro Asn Ser Asp Ala Pro val lie Arg Ser Lys Thr ser Ala Arg Tyr 

210 215 220 

Met Glu Leu val Gly Trp Leu val Asp Arg Gly lie Thr ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala ser Tyr lie Ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala ser Lys 

260 265 270 

lie Met Ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu val Gly Gin Asn 
275 280 285 
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290 



295 



305 



310 



325 



340 



355 



360 



370 



375 



385 



390 



405 



420 



435 



440 



450 



455 



465 



470 



485 



500 



515 



520 



530 



535 



Arg 


lie 


Tyr 


Arg 


lie 


Leu Glu Met 




300 






Ala 


Ser 


val 
315 


Phe 


Leu 


Gly Trp Ala 
320 


Thr 


lie 


Trp 


Leu 


Phe 


Gly Pro Ala 




330 






335 


Glu 


Ala 


He 


Ala 


His 


Ala Val Pro 


345 










350 


Asn 


Glu 


Asn 


Phe 


Pro 


Phe Asn Asp 










365 


Trp 


Glu 


Glu 


Gly 


Lys 


Met Thr Ala 






380 




lie 


Leu 


Gly 


Gly 


Ser 


Lys val Arg 






395 




400 


Ala 


Gin 


lie 


Asp 


Pro 


Thr Pro Val 




410 






415 


Cys 


Ala 


val 


lie 


Asp 


Gly Asn ser 


425 










430 


Leu 


Gin 


Asp 


Arq 


Met 


Phe Lys Phe 






445 




Asp 


Phe 


Gly 


Lys 


Val 


Thr Lys Gin 




460 






Ala 


Ser 


Asp 
475 


His 


Val 


Thr Glu val 
480 


Gly 


Gly 


Ala 


Arg 


Lys 


Arg Pro Ala 


490 




495 


Pro 


Lys 


Arg 


Ala 


Cys 


Pro Ser val 


505 






510 


Glu 


Ala 


Pro 


val 


Asp 


Tyr Ala Asp 










525 


Xaa 













<210> 11 
<211> 623 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 11 

Met Pro Gly Phe Tyr Glu He Val Leu Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie ser Asp ser Phe Val ser Trp val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

35 40 45 

Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Glu Phe Leu 

50 55 60 

Val Glu Trp Arg Arg val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 70 75 80 

Gin Phe Glu Lys Gly Asp Ser Tyr Phe His Leu His lie Leu Val Glu 

* 85 90 95 

Thr val Gly Val Lys ser Met Val val Gly Arg Tyr Val Ser Gin He 

100 105 110 

Lys Glu Lys Leu val Thr Arg lie Tyr Arg Gly val Glu Pro Gin Leu 

115 120 125 

Pro Asn Trp Phe Ala val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys val val Asp Asp Cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 
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Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn wet Asp Gin Tyr lie 

165 170 175 

ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Gin Asn 

195 200 205 

Pro Asn ser Asp Ala Pro val lie Arg ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu val Gly Trp Leu val Asp Arg Gly lie Thr ser Glu Lys 
225 230 235 n 240 

Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie ser Phe Asn Ala Ala 

245 250 255 

ser Asn ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Ser Lys 

260 265 270 

lie Met ser Leu Thr Lys Thr Ala Pro Asp Tyr Leu Val Gly Gin Asn 

275 280 285 

pro Pro Glu Asp lie ser Ser Asn Arg lie Tyr Arg lie Leu Glu Met 

290 295 " 300 ■ 

Asn Gly Tyr Asp Pro Gin Tyr Ala Ala Ser val Phe Leu Gly Trp Ala 
305 310 315 320 

Gin Lys Lys Phe Gly Lys Arg Asn Thr He Trp Leu Phe Gly Pro Ala 

325 ~ 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie Ala His Ala Val Pro 

340 345 350 

Phe Tyr Gly cys val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

cys val Asp Lys Met val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys val val Glu ser Ala Lys Ala lie Leu Gly Gly Ser Lys val Arg 
385 390 395 400 

Val Asp Gin Lys cys Lys Ser ser Ala Gin lie Asp Pro Thr Pro Val 

405 ' 410 415 

lie val Thr Ser Asn Thr Asn Met cys Ala Val lie Asp Gly Asn Ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Lys Arg Leu Glu His Asp Phe Gly Lys val Thr Lys Gin 

450 455 460 

Glu val Lys Asp Phe Phe Arg Trp Ala Ser Asp His val Thr Glu val 
465 470 475 480 

Thr His Glu Phe Tyr val Arg Lys Gly Gly Ala Arg Lys Arg Pro Ala 

485 490 495 

pro Asn Asp Ala Asp He ser Glu Pro Lys Arg Ala Cys Pro Ser Val 

500 505 „ 510 

Ala Gin Pro ser Thr ser Asp Ala Glu Ala Pro val Asp Tyr Ala Asp 

515 520 525 

Arq Tyr Gin Asn Lys Cys ser Arg His val Gly Met Asn Leu Met Leu 

530 535 540 

Phe Pro Cys Arg Gin Cys Glu Arg Met Asn Gin Asn Val Asp lie Cys 
545 550 555 560 

Phe Thr His Gly val Met Asp cys Ala Glu cys Phe Pro Val Ser Glu 

565 570 575 

ser Gin Pro val Ser Val val Arg Lys Arg Thr Tyr Gin Lys Leu cys 

580 585 ^ 590 

Pro He His His lie Met Gly Arg Ala Pro Glu Val Ala Cys ser Ala 

595 600 605 

cys Glu Leu Ala Asn val Asp Leu Asp Asp Cys Asp Met Glu Gin 
610 615 620 

<210> 12 
<211> 939 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

atggagctgg tcgggtggct ggtggaccgc gggatcacgt cagaaaagca atggatccag 60 

gaggaccagg cgtcctacat ctccttcaac gccgcctcca actcgcggtc acaaatcaag 120 

gccgcgctgg acaatgcctc caaaatcatg agcctgacaa agacggctcc ggactacctg 180 

gtgggccaga acccgccgga ggacatttcc agcaaccgca tctaccgaat cctcgagatg 240 

aacgggtacg atccgcagta cgcggcctcc gtcttcctgg gctgggcgca aaagaagttc 300 

gggaagagga acaccatctg gctctttggg ccggccacga cgggtaaaac caacatcgcg 360 

gaagccatcg cccacgccgt gcccttctac ggctgcgtga actggaccaa tgagaacttt 420 

ccgttcaacg attgcgtcga caagatggtg atctggtggg aggagggcaa gatgacggcc 480 

aaggtcgtag agagcgccaa ggccatcctg ggcggaagca aggtgcgcgt ggaccaaaag 540 

tgcaagtcat cggcccagat cgacccaact cccgtgatcg tcacctccaa caccaacatg 600 

tgcgcggtca tcgacggaaa ctcgaccacc ttcgagcacc aacaaccact ccaggaccgg 660 

atgttcaagt tcgagctcac caagcgcctg gagcacgact ttggcaaggt caccaagcag 720 

gaagtcaaag actttttccg gtgggcgtca gatcacgtga ccgaggtgac tcacgagttt 780 

tacgtcagaa agggtggagc tagaaagagg cccgccccca atgacgcaga tataagtgag 840 

cccaagcggg cctgtccgtc agttgcgcag ccatcgacgt cagacgcgga agctccggtg 900 

gactacgcgg acagattggc tagaggacaa cctctctga 939 

<210> 13 
<211> 1197 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 13 „ «. cn 

atggagctgg tcgggtggct ggtggaccgc gggatcacgt cagaaaagca atggatccag bu 

gaggaccagg cgtcctacat ctccttcaac gccgcctcca actcgcggtc acaaatcaag 120 

gccgcgctgg acaatgcctc caaaatcatg agcctgacaa agacggctcc ggactacctg 180 

gtgggccaga acccgccgga ggacatttcc agcaaccgca tctaccgaat cctcgagatg 240 

aacgggtacg atccgcagta cgcggcctcc gtcttcctgg gctgggcgca aaagaagttc 300 

gggaagagga acaccatctg gctctttggg ccggccacga cgggtaaaac caacatcgcg 360 

gaagccatcg cccacgccgt gcccttctac ggctgcgtga actggaccaa tgagaacttt 420 

ccgttcaacg attgcgtcga caagatggtg atctggtggg aggagggcaa gatgacggcc 480 

aaggtcgtag agagcgccaa ggccatcctg ggcggaagca aggtgcgcgt ggaccaaaag 540 

tgcaagtcat cggcccagat cgacccaact cccgtgatcg tcacctccaa caccaacatg 600 

tgcgcggtca tcgacggaaa ctcgaccacc ttcgagcacc aacaaccact ccaggaccgg 660 

atgttcaagt tcgagctcac caagcgcctg gagcacgact ttggcaaggt caccaagcag 720 

gaagtcaaag actttttccg gtgggcgtca gatcacgtga ccgaggtgac tcacgagttt 780 

tacgtcagaa agggtggagc tagaaagagg cccgccccca atgacgcaga tataagtgag 840 

cccaagcggg cctgtccgtc agttgcgcag ccatcgacgt cagacgcgga agctccggtg 900 

gactacgcgg acaggtacca aaacaaatgt tctcgtcacg tgggtatgaa tctgatgctt 960 

tttccctgcc ggcaatgcga gagaatgaat cagaatgtgg acatttgctt cacgcacggg 1020 

gtcatggact gtgccgagtg cttccccgtg tcagaatctc aacccgtgtc tgtcgtcaga 1080 

aagcggacgt atcagaaact gtgtccgatt catcacatca tggggagggc gcccgaggtg 1140 

gcctgctcgg cctgcgaact ggccaatgtg gacttggatg actgtgacat ggaacaa 1197 

<210> 14 
<211> 1611 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 
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<400> 14 

atgccggggt tctacgagat cgtgctgaag gtgcccagcg acctggacga gcacctgccc 60 

ggcatttctg actcttttgt gagctgggtg gccgagaagg aatgggagct gccgccggat 120 

tctgacatgg acttgaatct gattgagcag gcacccctga ccgtggccga aaagctgcaa 180 

cgcgagttcc tggtcgagtg gcgccgcgtg agtaaggccc cggaggccct cttctttgtc 240 

cagttcgaga agggggacag ctacttccac ctgcacatcc tggtggagac cgtgggcgtc 300 

aaatccatgg tggtgggccg ctacgtgagc cagattaaag agaagctggt gacccgcatc 360 

taccgcgggg tcgagccgca gcttccgaac tggttcgcgg tgaccaagac gcgtaatggc 420 

gccggaggcg ggaacaaggt ggtggacgac tgctacatcc ccaactacct gctccccaag 480 

acccagcccg agctccagtg ggcgtggact aacatggacc agtatataag cgcctgtttg 540 

aatctcgcgg agcgtaaacg gctggtggcg cagcatctga cgcacgtgtc gcagacgcag 600 

gagcagaaca aggaaaacca gaaccccaat tctgacgcgc cggtcatcag gtcaaaaacc 660 

tccgccaggt acatggagct ggtcgggtgg ctggtggacc gcgggatcac gtcagaaaag 720 

caatggatcc aggaggacca ggcgtcctac atctccttca acgccgcctc caactcgcgg 780 

tcacaaatca aggccgcgct ggacaatgcc tccaaaatca tgagcctgac aaagacggct 840 

ccggactacc tggtgggcca gaacccgccg gaggacattt ccagcaaccg catctaccga 900 

atcctcgaga tgaacgggta cgatccgcag tacgcggcct ccgtcttcct gggctgggcg 960 

caaaagaagt tcgggaagag gaacaccatc tggctctttg ggccggccac gacgggtaaa 1020 

accaacatcg cggaagccat cgcccacgcc gtgcccttct acggctgcgt gaactggacc 1080 

aatgagaact ttccgttcaa cgattgcgtc gacaagatgg tgatctggtg ggaggagggc 1140 

aagatgacgg ccaaggtcgt agagagcgcc aaggccatcc tgggcggaag caaggtgcgc 1200 

gtggaccaaa agtgcaagtc atcggcccag atcgacccaa ctcccgtgat cgtcacctcc 1260 

aacaccaaca tgtgcgcggt catcgacgga aactcgacca ccttcgagca ccaacaacca 1320 

ctccaggacc ggatgttcaa gttcgagctc accaagcgcc tggagcacga ctttggcaag 1380 

gtcaccaagc aggaagtcaa agactttttc cggtgggcgt cagatcacgt gaccgaggtg 1440 

actcacgagt tttacgtcag aaagggtgga gctagaaaga ggcccgcccc caatgacgca 1500 

gatataagtg agcccaagcg ggcctgtccg tcagttgcgc agccatcgac gtcagacgcg 1560 

gaagctccgg tggactacgc ggacagattg gctagaggac aacctctctg a 1611 

<210> 15 
<211> 1872 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 15 

atgccggggt tctacgagat cgtgctgaag gtgcccagcg acctggacga gcacctgccc 60 

ggcatttctg actcttttgt gagctgggtg gccgagaagg aatgggagct gccgccggat 120 

tctgacatgg acttgaatct gattgagcag gcacccctga ccgtggccga aaagctgcaa 180 

cgcgagttcc tggtcgagtg gcgccgcgtg agtaaggccc cggaggccct cttctttgtc 240 

cagttcgaga agggggacag ctacttccac ctgcacatcc tggtggagac cgtgggcgtc 300 

aaatccatgg tggtgggccg ctacgtgagc cagattaaag agaagctggt gacccgcatc 360 

taccgcgggg tcgagccgca gcttccgaac tggttcgcgg tgaccaagac gcgtaatggc 420 

gccggaggcg ggaacaaggt ggtggacgac tgctacatcc ccaactacct gctccccaag 480 

acccagcccg agctccagtg ggcgtggact aacatggacc agtatataag cgcctgtttg 540 

aatctcgcgg agcgtaaacg gctggtggcg cagcatctga cgcacgtgtc gcagacgcag 600 

gagcagaaca aggaaaacca gaaccccaat tctgacgcgc cggtcatcag gtcaaaaacc 660 

tccgccaggt acatggagct ggtcgggtgg ctggtggacc gcgggatcac gtcagaaaag 720 

caatggatcc aggaggacca ggcgtcctac atctccttca acgccgcctc caactcgcgg 780 

tcacaaatca aggccgcgct ggacaatgcc tccaaaatca tgagcctgac aaagacggct 840 

ccggactacc tggtgggcca gaacccgccg gaggacattt ccagcaaccg catctaccga 900 

atcctcgaga tgaacgggta cgatccgcag tacgcggcct ccgtcttcct gggctgggcg 960 

caaaagaagt tcgggaagag gaacaccatc tggctctttg ggccggccac gacgggtaaa 1020 

accaacatcg cggaagccat cgcccacgcc gtgcccttct acggctgcgt gaactggacc 1080 

aatgagaact ttccgttcaa cgattgcgtc gacaagatgg tgatctggtg ggaggagggc 1140 

aagatgacgg ccaaggtcgt agagagcgcc aaggccatcc tgggcggaag caaggtgcgc 1200 

gtggaccaaa agtgcaagtc atcggcccag atcgacccaa ctcccgtgat cgtcacctcc 1260 

aacaccaaca tgtgcgcggt catcgacgga aactcgacca ccttcgagca ccaacaacca 1320 

ctccaggacc ggatgttcaa gttcgagctc accaagcgcc tggagcacga ctttggcaag 1380 

gtcaccaagc aggaagtcaa agactttttc cggtgggcgt cagatcacgt gaccgaggtg 1440 
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actcacgagt tttacgtcag aaagggtgga gcxagaaaga ggcccgcccc caatgacgca 1500 

gatataagtg agcccaagcg ggcctgtccg tcagttgcgc agccatcgac gtcagacgcg 1560 

gaagctccgg tggactacgc ggacaggtac caaaacaaat gttctcgtca cgtgggtatg 1620 

aatctgatgc tttttccctg ccggcaatgc gagagaatga atcagaatgt ggacatttgc 1680 

ttcacgcacg gggtcatgga ctgtgccgag tgcttccccg tgtcagaatc tcaacccgtg 1740 

tctgtcgtca gaaagcggac gtatcagaaa ctgtgtccga ttcatcacat catggggagg 1800 

gcgcccgagg tggcctgctc ggcctgcgaa ctggccaatg tggacttgga tgactgtgac 1860 

atggaacaat aa 1872 

<210> 16 
<211> 598 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 



15 

Gin Pro Ala Lys Lys 
30 

Asp Gly Pro Pro Glu 
45 

Glu Met Arg Ala Ala 
60 

Ala Asp Gly Val Gly 

80 

Trp Ser Glu Gly His 

95 

Pro Thr Tyr Asn Asn 
110 

ser Asn Thr Tyr Asn 
125 

Asn Arg Phe His cys 
140 

Asn Asn Asn Trp Gly 

160 

Asn lie Gin val Lys 

175 

Ala Asn Asn Leu Thr 
190 

Glu Leu Pro Tyr Val 
205 

Phe Pro Asn Asp Val 
220 

Val Thr Gly Asn Thr 

240 

Cys Leu Glu Tyr Phe 

255 

Glu lie Thr Tyr Ser 
270 

His Ser Gin Ser Leu 
285 

Leu Trp Gly Leu Gin 
300 

Thr Ala Thr Thr Asn 

320 

Phe Lys Lys Asn Trp 

335 

Ser Lys Thr Ala Asn 
350 



<400> 16 
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Gin Asn Tyr Lys lie Pro Ala Thr Gly Ser Asp ser Leu lie Lys Tyr 

355 360 365 

Glu Thr His ser Thr Leu Asp Gly Arg Trp Ser Ala Leu Thr Pro Gly 

370 375 380 

Pro Pro Met Ala Thr Ala Gly Pro Ala Asp ser Lys Phe ser Asn Ser 
385 390 395 400 

Gin Leu lie Phe Ala Gly Pro Lys Gin Asn Gly Asn Thr Ala Thr val 

405 410 415 

pro Gly Thr Leu lie Phe Thr ser Glu Glu Glu Leu Ala Ala Thr Asn 

420 425 430 

Ala Thr Asp Thr Asp Met Trp Gly Asn Leu Pro Gly Gly Asp Gin ser 

435 440 445 

Asn ser Asn Leu Pro Thr Val Asp Arg Leu Thr Ala Leu Gly Ala Val 

450 455 460 

pro Gly Met val Trp Gin Asn Arg Asp lie Tyr Tyr Gin Gly Pro lie 
465 470 475 480 

Trp Ala Lys lie Pro His Thr Asp Gly His Phe His Pro Ser Pro Leu 

485 490 495 

He Gly Gly Phe Gly Leu Lys His Pro Pro Pro Gin lie Phe lie Lys 

500 505 510 

Asn Thr Pro val Pro Ala Asn Pro Ala Thr Thr Phe Ser Ser Thr Pro 

515 520 525 

val Asn Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin val Ser val Gin 

530 535 540 

He Asp Trp Glu He Gin Lys Glu Arg Ser Lys Arg Trp Asn Pro Glu 
545 550 555 560 

val Gin Phe Thr Ser Asn Tyr Gly Gin Gin Asn Ser Leu Leu Trp Ala 

565 570 575 

pro Asp Ala Ala Gly Lys Tyr Thr Glu Pro Arg Ala lie Gly Thr Arg 

580 585 590 

Tyr Leu Thr His His Leu 
595 

<210> 17 
<211> 1800 
<212> DNA 

<213> Artificial Sequence 

<220> . „ 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<221> miscfeature 
<222> (0)...(0) 
<223> n=a,t f c, or g 

<221> variation 
<222> (0)... CO) 
<223> Xaa = any amino acid 

<400> 17 fc ^ 

acggctcctg gaaagaagag accgttgatt gaatcccccc agcagcccga ctcctccacg 

ggtatcggca aaaaaggcaa gcagccggct aaaaagaagc tcgttttcga agacgaaact 120 

ggagcaggcg acggaccccc tgagggatca acttccggag ccatgtctga tgacagtgag 180 

atgcgtgcag cagctggcgg agctgcagtc gagggsggac aaggtgccga tggagtgggt 240 

aatgcctcgg gtgattggca ttgcgattcc acctggtctg agggccacgt cacgaccacc 300 

agcaccagaa cctgggtctt gcccacctac aacaaccacc tntacaagcg actcggagag 360 

agcctgcagt ccaacaccta caacggattc tccaccccct ggggatactt tgacttcaac 420 

cgcttccact gccacttctc accacgtgac tggcagcgac tcatcaacaa caactggggc 480 

atgcgaccca aagccatgcg ggtcaaaatc ttcaacatcc aggtcaagga ggtcacgacg 540 

tcgaacggcg agacaacggt ggctaataac cttaccagca cggttcagat ctttgcggac 600 

tcgtcgtacg aactgccgta cgtgatggat gcgggtcaag agggcagcct gcctcctttt bbu 

cccaacgacg tctttatggt gccccagtac ggctactgtg qactggtgac cggcaacact /^u 



60 
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tcgcagcaac agactgacag aaatgccttc tactgcctgg agtactttcc ttcgcagatg 780 

ctgcggactg gcaacaactt tgaaattacg tacagttttg agaaggtgcc tttccactcg 840 

atgtacgcgc acagccagag cctggaccgg ctgatgaacc ctctcatcga ccagtacctg 900 

tggggactgc aatcgaccac caccggaacc accctgaatg ccgggactgc caccaccaac 960 

tttaccaagc tgcggcctac caacttttcc aactttaaaa agaactggct gcccgggcct 1020 

tcaatcaagc agcagggctt ctcaaagact gccaatcaaa actacaagat ccctgccacc 1080 

gggtcagaca gtctcatcaa atacgagacg cacagcactc tggacggaag atggagtgcc 1140 

ctgacccccg gacctccaat ggccacggct ggacctgcgg acagcaagtt cagcaacagc 1200 

cagctcatct ttgcggggcc taaacagaac ggcaacacgg ccaccgtacc cgggactctg 1260 

atcttcacct ctgaggagga gctggcagcc accaacgcca ccgatacgga catgtggggc 1320 

aacctacctg gcggtgacca gagcaacagc aacctgccga ccgtggacag actgacagcc 1380 

ttgggagccg tgcctggaat ggtctggcaa aacagagaca tttactacca gggtcccatt 1440 

tgggccaaga ttcctcatac cgatggacac tttcacccct caccgctgat tggtgggttt 1500 

gggctgaaac acccgcctcc tcaaattttt atcaagaaca ccccggtacc tgcgaatcct 1560 

gcaacgacct tcagctctac tccggtaaac tccttcatta ctcagtacag cactggccag 1620 

gtgtcggtgc agattgactg ggagatccag aaggagcggt ccaaacgctg gaaccccgag 1680 

gtccagttta cctccaacta cggacagcaa aactctctgt tgtgggctcc cgatgcggct 1740 

gggaaataca ctgagcctag ggctatcggt acccgctacc tcacccacca cctgtaataa 1800 

<210> 18 
<211> 544 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 18 n , . 

Met Ser Asp Asp Ser Glu Met Arg Ala Ala Ala Gly Gly Ala Ala Val 

15 10 15 

Glu Gly Gly Gin Gly Ala Asp Gly val Gly Asn Ala ser Gly Asp Trp 

20 25 30 

His cys Asp ser Thr Trp Ser Glu Gly His Val Thr Thr Thr ser Thr 

35 40 45 

Arg Thr Trp val Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Arg Leu 

50 55 60 

Gly Glu Ser Leu Gin Ser Asn Thr Tyr Asn Gly Phe ser Thr Pro Trp 
65 70 75 80 

Gly Tyr Phe Asp Phe Asn Arg Phe His cys His Phe ser Pro Arg Asp 

y 85 90 95 

Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Met Arg Pro Lys Ala Met 

y 100 105 n 110 

Arg val Lys lie Phe Asn He Gin val Lys Glu val Thr Thr Ser Asn 

115 120 125 

Gly Glu Thr Thr Val Ala Asn Asn Leu Thr Ser Thr Val Gin lie Phe 

130 135 140 

Ala Asp Ser Ser Tyr Glu Leu Pro Tyr val Met Asp Ala Gly Gin Glu 
145 150 155 160 

Gly ser Leu Pro Pro Phe Pro Asn Asp Val Phe Met Val Pro Gin Tyr 

165 170 175 

Gly Tyr Cys Gly Leu Val Thr Gly Asn Thr ser Gin Gin Gin Thr Asp 

180 185 190 

Arg Asn Ala Phe Tyr Cys Leu Glu Tyr Phe Pro ser Gin Met Leu Arg 

195 200 205 

Thr Gly Asn Asn Phe Glu He Thr Tyr Ser Phe Glu Lys val Pro Phe 

210 215 220 

His Ser Met Tyr Ala His Ser Gin ser Leu Asp Arg Leu Met Asn Pro 
225 230 235 240 

Leu lie Asp Gin Tyr Leu Trp Gly Leu Gin Ser Thr Thr Thr Gly Thr 

245 250 255 

Thr Leu Asn Ala Gly Thr Ala Thr Thr Asn Phe Thr Lys Leu Arg Pro 

260 265 270 
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i Ml Mill ri ic 3ci rwii ■ ^ ■ " r - — 

275 280 285 

Lvs Gin Gin Gly Phe Ser Lys Thr Ala Asn Gin Asn Tyr Lys lie Pro 

290 295 300 

Ala Thr Gly Ser Asp Ser Leu lie Lys Tyr Glu Thr His Ser Thr Leu 
305 310 315 320 

Asp Gly Arg Trp Ser Ala Leu Thr Pro Gly Pro Pro Met Ala Thr Ala 

Gly Pro Ala Asp Ser Lys Phe Ser Asn Ser Gin Leu lie Phe Ala Gly 

340 345 350 

pro Lys Gin Asn Gly Asn Thr Ala Thr val Pro Gly Thr Leu lie Phe 

355 360 365 

Thr Ser Glu Glu Glu Leu Ala Ala Thr Asn Ala Thr Asp Thr Asp Met 

370 375 380 

Tro Gly Asn Leu Pro Gly Gly Asp Gin Ser Asn Ser Asn Leu Pro Thr 
385 390 395 400 

Val Asp Arg Leu Thr Ala Leu Gly Ala Val Pro Gly Met val Trp Gin 

405 ' 410 415 

Asn Arg Asp He Tyr Tyr Gin Gly Pro lie Trp Ala Lys lie Pro His 

420 425 430 

Thr Asp Gly His Phe His Pro ser Pro Leu lie Gly Gly Phe Gly Leu 

435 440 445 

lvs His Pro Pro Pro Gin lie Phe lie Lys Asn Thr Pro val Pro Ala 

450 455 460 

Asn Pro Ala Thr Thr Phe Ser Ser Thr Pro val Asn Ser Phe He Thr 
465 470 475 480 

Gin Tyr Ser Thr Gly Gin Val ser Val Gin lie Asp Trp Glu lie Gin 

485 490 495 

Lys Glu Arg ser Lys Arg Trp Asn Pro Glu val Gin Phe Thr Ser Asn 

500 505 510 

Tyr Gly Gin Gin Asn ser Leu Leu Trp Ala Pro Asp Ala Ala Gly Lys 

515 520 525 

Tyr Thr Glu Pro Arg Ala lie Gly Thr Arg Tyr Leu Thr His His Leu 
530 535 540 

<210> 19 
<211> 1617 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<221> misc_feature 
<222> (0) . - . CO) 
<223> n=a,t,c, or g 

<221> variation 
<222> (0)...(0) 
<223> Xaa = any amino acid 

<400> 19 _ rn 

atgcgtgcag cagctggcgg agctgcagtc gagggsggac aaggtgccga tggagtgggt bu 

aatgcctcgg gtgattggca ttgcgattcc acctggtctg agggccacgt cacgaccacc 120 

agcaccagaa cctgggtctt gcccacctac aacaaccacc tntacaagcg actcggagag 180 

agcctgcagt ccaacaccta caacggattc tccaccccct ggggatactt tgacttcaac 240 

cgcttccact gccacttctc accacgtgac tggcagcgac tcatcaacaa caactggggc iuu 

atgcgaccca aagccatgcg ggtcaaaatc ttcaacatcc aggtcaagga ggtcacgacg ibu 

tcgaacggcg agacaacggt ggctaataac cttaccagca cggttcagat ctttgcggac 4£U 

tcgtcgtacg aactgccgta cgtgatggat gcgggtcaag agggcagcct gcctcctttt 4»u 

cccaacgacg tctttatggt gccccagtac ggctactgtg gactggtgac cggcaacact 540 

tcgcagcaac agactgacag aaatgccttc tactgcctgg agtactttcc ttcgcagatg 600 
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660 



900 
960 
020 
1080 



ctgcggactg gcaacaactt tgaaattacg tacagrcttg agaaggxgcc tttccactcg 

atgtacgcgc acagccagag cctggaccgg ctgatgaacc ctctcatcga ccagtacctg 720 

tqgggactgc aatcgaccac caccggaacc accctgaatg ccgggactgc caccaccaac 780 

tttaccaagc tgcggcctac caacttttcc aactttaaaa agaactggct gcccgggcct 840 

tcaatcaagc agcagggctt ctcaaagact gccaatcaaa actacaagat ccctgccacc 

gggtcagaca gtctcatcaa atacgagacg cacagcactc tggacggaag atggagtgcc 

ctgacccccg gacctccaat ggccacggct ggacctgcgg acagcaagtt cagcaacagc 1020 

cagctcatct ttgcggggcc taaacagaac ggcaacacgg ccaccgtacc cgggactctg 1080 

atcttcacct ctgaggagga gctggcagcc accaacgcca ccgatacgga catgtggggc 1140 

aacctacctg gcggtgacca gagcaacagc aacctgccga ccgtggacag actgacagcc 1200 

ttgggagccg tgcctggaat ggtctggcaa aacagagaca tttactacca gggtcccatt 1260 

tgggccaaga ttcctcatac cgatggacac tttcacccct caccgctgat tggtgggttt 1320 

qqgctgaaac acccgcctcc tcaaattttt atcaagaaca ccccggtacc tgcgaatcct 1380 

gcaacgacct tcagctctac tccggtaaac tccttcatta ctcagtacag cactggccag 1440 

gtgtcggtgc agattgactg ggagatccag aaggagcggt ccaaacgctg gaaccccgag 1500 

gtccagttta cctccaacta cggacagcaa aactctctgt tgtgggctcc cgatgcggct 1560 

gggaaataca ctgagcctag ggctatcggt acccgctacc tcacccacca cctgtaa 1617 

<210> 20 
<211> 129 
<212> DNA 

<213> Artificial Sequence 

<220> . , 

<223> Description of Artificial Sequence; note = 
synthetic construct 



60 
20 
129 



<400> 20 

ttggccactc cctctatgcg cgctcgctca ctcactcggc cctgcggcca gaggccggca 
gtctggagac ctttggtgtc cagggcaggg ccgagtgagt gagcgagcgc gcatagaggg 120 
agtggccaa 

<210> 21 
<211> 35 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 21 

tctagtctag acttggccac tccctctctg cgcgc " 

<210> 22 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 22 -., 
aggccttaag agcagtcgtc caccaccttg ttcc ^ 

<210> 23 
<211> 4652 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
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synthetic construct 

<400> 23 , A 

tggcactctc ccccctgtcg cgttcgctcg ctcgctggct cgtttggggg ggtggcagct 60 

caaagagctg ccagacgacg gccctctggc cgtcgccccc ccaaacgagc cagcgagcga 120 

gcgaacgcga caggggggag agtgccacac tctcaagcaa gggggttttg taagcagtga 180 

tgtcataatg atgtaatgct tattgtcacg cgatagttaa tgattaacag tcatgtgatg 240 

tqttttatcc aataggaaga aagcgcgcgt atgagttctc gcgagacttc cggggtataa 300 

aagaccgagt gaacgagccc gccgccattc tttgctctgg actgctagag gaccctcgct 360 

gccatggcta ccttctatga agtcattgtt cgcgtcccat ttgacgtgga ggaacatctg 420 

cctggaattt ctgacagctt tgtggactgg gtaactggtc aaatttggga gctgcctcca 480 

gagtcagatt taaatttgac tctggttgaa cagcctcagt tgacggtggc tgatagaatt 540 

cgccgcgtgt tcctgtacga gtggaacaaa ttttccaagc aggagtccaa attctttgtg 600 

cagtttgaaa agggatctga atattttcat ctgcacacgc ttgtggagac ctccggcatc 660 

tcttccatgg tcctcggccg ctacgtgagt cagattcgcg cccagctggt gaaagtggtc 720 

ttccagggaa ttgaacccca gatcaacgac tgggtcgcca tcaccaaggt aaagaagggc 780 

ggagccaata aggtggtgga ttctgggtat attcccgcct acctgctgcc gaaggtccaa 840 

ccggagcttc agtgggcgtg gacaaacctg gacgagtata aattggccgc cctgaatctg 900 

gaggagcgca aacggctcgt cgcgcagttt ctggcagaat cctcgcagcg ctcgcaggag 960 

gcggcttcgc agcgtgagtt ctcggctgac ccggtcatca aaagcaagac ttcccagaaa 1020 

tacatggcgc tcgtcaactg gctcgtggag cacggcatca cttccgagaa gcagtggatc 1080 

caggaaaatc aggagagcta cctctccttc aactccaccg gcaactctcg gagccagatc 1140 

aaggccgcgc tcgacaacgc gaccaaaatt atgagtctga caaaaagcgc ggtggactac 1200 

ctcgtgggga gctccgttcc cgaggacatt tcaaaaaaca gaatctggca aatttttgag 1260 

atgaatggct acgacccggc ctacgcggga tccatcctct acggctggtg tcagcgctcc 1320 

ttcaacaaga ggaacaccgt ctggctctac ggacccgcca cgaccggcaa gaccaacatc 1380 

gcggaggcca tcgcccacac tgtgcccttt tacggctgcg tgaactggac caatgaaaac 1440 

tttcccttta atgactgtgt ggacaaaatg ctcatttggt gggaggaggg aaagatgacc 1500 

aacaaggtgg ttgaatccgc caaggccatc ctggggggct caaaggtgcg ggtcgatcag 1560 

aaatgtaaat cctctgttca aattgattct acccctgtca ttgtaacttc caatacaaac 1620 

atgtgtgtgg tggtggatgg gaattccacg acctttgaac accagcagcc gctggaggac 1680 

cgcatgttca aatttgaact gactaagcgg ctcccgccag attttggcaa gattactaag 1740 

caggaagtca aggacttttt tgcttgggca aaggtcaatc aggtgccggt gactcacgag 1800 

tttaaagttc ccagggaatt ggcgggaact aaaggggcgg agaaatctct aaaacgccca 1860 

ctgggtgacg tcaccaatac tagctataaa agtctggaga agcgggccag gctctcattt 1920 

gttcccgaga cgcctcgcag ttcagacgtg actgttgatc ccgctcctct gcgaccgctc 1980 

aattggaatt caaggtatga ttgcaaatgt gactatcatg ctcaatttga caacatttct 2040 

aacaaatgtg atgaatgtga atatttgaat cggggcaaaa atggatgtat ctgtcacaat 2100 

gtaactcact gtcaaatttg tcatgggatt cccccctggg aaaaggaaaa cttgtcagat 2160 

tttggggatt ttgacgatgc caataaagaa cagtaaataa agcgagtagt catgtctttt 2220 

gttgatcacc ctccagattg gttggaagaa gttggtgaag gtcttcgcga gtttttgggc 2280 

cttgaagcgg gcccaccgaa accaaaaccc aatcagcagc atcaagatca agcccgtggt 2340 

cttgtgctgc ctggttataa ctatctcgga cccggaaacg gtctcgatcg aggagagcct 2400 

gtcaacaggg cagacgaggt cgcgcgagag cacgacatct cgtacaacga gcagcttgag 2460 

gcgggagaca acccctacct caagtacaac cacgcggacg ccgagtttca ggagaagctc 2520 

gccgacgaca catccttcgg gggaaacctc ggaaaggcag tctttcaggc caagaaaagg 2580 

gttctcgaac cttttggcct ggttgaagag ggtgctaaga cggcccctac cggaaagcgg 2640 

atagacgacc actttccaaa aagaaagaag gctcggaccg aagaggactc caagccttcc 2700 

acctcgtcag acgccgaagc tggacccagc ggatcccagc agctgcaaat cccagcccaa 2760 

ccagcctcaa gtttgggagc tgatacaatg tctgcgggag gtggcggccc attgggcgac 2820 

aataaccaag gtgccgatgg agtgggcaat gcctcgggag attggcattg cgattccacg 2880 

tggatggggg acagagtcgt caccaagtcc acccgaacct gggtgctgcc cagctacaac 2940 

aaccaccagt accgagagat caaaagcggc tccgtcgacg gaagcaacgc caacgcctac 3000 

tttggataca gcaccccctg ggggtacttt gactttaacc gcttccacag ccactggagc 3060 

ccccgagact ggcaaagact catcaacaac tactggggct tcagaccccg gtccctcaga 3120 

gtcaaaatct tcaacattca agtcaaagag gtcacggtgc aggactccac caccaccatc 3180 

gccaacaacc tcacctccac cgtccaagtg tttacggacg acgactacca gctgccctac 3240 

gtcgtcggca acgggaccga gggatgcctg ccggccttcc ctccgcaggt ctttacgctg 3300 

ccgcagtacg gttacgcgac gctgaaccgc gacaacacag aaaatcccac cgagaggagc 336U 

agcttcttct gcctagagta ctttcccagc aagatgctga gaacgggcaa caactttgag 3420 

tttacctaca actttgagga ggtgcccttc cactccagct tcgctcccag tcagaacctg 3480 

ttcaagctgg ccaacccgct ggtggaccag tacttgtacc gcttcgtgag cacaaataac 3540 

actggcggag tccagttcaa caagaacctg gccgggagat acgccaacac ctacaaaaac 3600 
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<400> 24 










val 


Glu 


His Gly 


He 


Met 


Ala 


Leu 


Val 


Asn 


Trp 


Leu 


1 








5 








10 




Gin 


Trp 


He 


Gin 


Glu 


Asn 


Gin 


Glu 


Ser 


Tyr Leu 


Ser 






20 










25 


Ala Leu 




Gly 


Asn 


ser 


Arg 


Ser 


Gin 


He 


Lys 


Ala 


Asp 




35 








40 


val 






He 


Met 


Ser 


Leu 


Thr 


Lys 


ser 


Ala 


Asp Tyr 


Leu 




50 








55 






lie Trp 


60 


val 


Pro 


Glu 


ASP 


lie 


Ser 


Lys 


Asn 


Arg 


Gin 


65 








70 




Gly 


75 




Asn 


Gly 


Tyr 


ASp 


Pro 


Ala 


Tyr 


Ala 


ser lie 


Leu 




85 








Thr 


90 




Gin 


Arg 


Ser 


Phe 


Asn 


Lys 


Arg 


Asn 


val Trp 


Leu 






100 






105 




Ala 


Thr 


Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala lie 






115 








120 






Phe 


Phe 


Tyr 


Gly 


cys 


Val 


Asn 


Trp 


Thr 


Asn 


Glu Asn 




130 






135 






Glu Glu 


140 


Cys 


Val 


Asp 


Lys 


Met 


Leu 


He 


Trp 


Trp 


Gly 


145 






150 






He 


155 


Gly 


Lys 


Val 


Val 


Glu 


ser 


Ala 


Lys 


Ala 


Leu Gly 








165 






Val 


170 




Val 


Asp 


Gin 


Lys 


cys 


Lys 


Ser 


Ser 


Gin He 


Asp 






180 






185 


val Val 


val 


lie 


val 


Thr 


ser 


Asn 


Thr 


Asn 


Met 


cys 






195 










200 


Glu Asp 




Thr 


Thr 


Phe 


Glu 


His 


Gin 


Gin 


Pro 


Leu 


Arg 




210 










215 






Phe Gly 


220 


Glu 


Leu 


Thr 


Lys 


Arg 


Leu 


Pro 


Pro 


Asp 


Lys 


225 






230 








235 




Glu 


val 


Lys 


Asp 


Phe 


Phe 


Ala 


Trp 


Ala 


Lys val 


Asn 






245 








250 


Gly 


Thr 


His 


Glu 


Phe 


Lys 


Val 


Pro 


Arg 


Glu 


Leu Ala 








260 






265 







15 

phe Asn Se 
30 

Asn Ala Th 
45 



80 

Tyr Gly Trp cy: 
95 

Tyr Gly Pro Ali 
110 

His Thr val Pn 
125 



160 

Ser Lys val Arg 
175 

Ser Thr Pro Val 
190 

Asp Gly Asn Ser 
205 



* 240 
Val Pro val 

255 
Lys Gly Ala 
270 



tggttcccgg ggcccatggg ccgaacccag ggctggaacc tgggctccgg ggtcaaccgc 3660 

gccagtgtca gcgccttcgc cacgaccaat aggatggagc tcgagggcgc gagttaccag 3720 

gtgcccccgc agccgaacgg catgaccaac aacctccagg gcagcaacac ctatgccctg 3780 

gagaacacta tgatcttcaa cagccagccg gcgaacccgg gcaccaccgc cacgtacctc 3840 

gagggcaaca tgctcatcac cagcgagagc gagacgcagc cggtgaaccg cgtggcgtac 3900 

aacgtcggcg ggcagatggc caccaacaac cagagctcca ccactgcccc cgcgaccggc 3960 

acgtacaacc tccaggaaat cgtgcccggc agcgtgtgga tggagaggga cgtgtacctc 4020 

caaggaccca tctgggccaa gatcccagag acgggggcgc actttcaccc ctctccggcc 4080 

atgggcggat tcggactcaa acacccaccg cccatgatgc tcatcaagaa cacgcctgtg 4140 

cccggaaata tcaccagctt ctcggacgtg cccgtcagca gcttcatcac ccagtacagc 4200 

accgggcagg tcaccgtgga gatggagtgg gagctcaaga aggaaaactc caagaggtgg 4260 

aacccagaga tccagtacac aaacaactac aacgaccccc agtttgtgga ctttgccccg 4320 

gacagcaccg gggaatacag aaccaccaga cctatcggaa cccgatacct tacccgaccc 4380 

ctttaaccca ttcatgtcgc ataccctcaa taaaccgtgt attcgtgtca gtaaaatact 4440 

gcctcttgtg gtcattcaat gaataacagc ttacaacatc tacaaaacct ccttgcttga 4500 

gagtgtggca ctctcccccc tgtcgcgttc gctcgctcgc tggctcgttt gggggggtgg 4560 

cagctcaaag agctgccaga cgacggccct ctggccgtcg cccccccaaa cgagccagcg 4620 

agcgagcgaa cgcgacaggg gggagagtgc ca 4652 

<210> 24 
<211> 390 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 
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Glu lvs Ser Leu Lys Arg Pro Leu Gly Asp Val Thr Asn Thr ser Tyr 

275 280 285 

Lvs ser Leu Glu Lys Arg Ala Arg Leu ser Phe val Pro Glu Thr Pro 

290 295 300 

Ara ser ser Asp Val Thr Val Asp Pro Ala Pro Leu Arg Pro Leu Asn 
305 310 315 n ^ u 320 

Trp Asn ser Arg Tyr Asp cys Lys cys Asp Tyr His Ala Gin Phe Asp 

325 330 335 

Asn lie ser Asn Lys Cys Asp Glu cys Glu Tyr Leu Asn Arg Gly Lys 

340 345 350 . 

Asn Gly Cys lie Cys His Asn Val Thr His cys Gin lie cys His Gly 

355 3 60 365 

lie Pro Pro Trp Glu Lys Glu Asn Leu ser Asp Phe Gly Asp Phe Asp 

370 375 380 

Asp Ala Asn Lys Glu Gin 
385 390 

<210> 25 
<211> 594 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

< 4 00> 25 w . . . 

Glu His Leu Pro Gly lie Ser Asp ser Phe val Asp Trp Val Thr Gly 

1 5 10 15 

Gin lie Trp Glu Leu Pro Pro Glu ser Asp Leu Asn Leu Thr Leu Val 

20 25 30 

Glu Gin Pro Gin Leu Thr val Ala Asp Arg lie Arg Arg Val Phe Leu 

35 40 45 

Tyr Glu Trp Asn Lys Phe Ser Lys Gin Glu Ser Lys Phe Phe val Gin 

50 55 60 

Phe Glu Lys Gly ser Glu Tyr Phe His Leu His Thr Leu val Glu Thr 
65 70 75 80 

Ser Gly lie Ser Ser Met Val Leu Gly Arg Tyr Val Ser Gin lie Arg 

85 90 95 

Ala Gin Leu Val Lys val val Phe Gin Gly lie Glu Pro Gin lie Asn 

100 105 110 

Asp Trp Val Ala lie Thr Lys val Lys Lys Gly Gly Ala Asn Lys Val 

115 120 125 

Val Asp ser Gly Tyr lie Pro Ala Tyr Leu Leu Pro Lys val Gin Pro 

130 135 140 

Glu Leu Gin Trp Ala Trp Thr Asn Leu Asp Glu Tyr Lys Leu Ala Ala 
145 150 155 160 

Leu Asn Leu Glu Glu Arg Lys Arg Leu Val Ala Gin Phe Leu Ala Glu 

165 ^ 170 17 5 

ser Ser Gin Arg ser Gin Glu Ala Ala ser Gin Arg Glu Phe ser Ala 

180 185 190 

Asp Pro val lie Lys Ser Lys Thr Ser Gin Lys Tyr Met Ala Leu val 

195 200 205 

Asn Trp Leu Val Glu His Gly lie Thr ser Glu Lys Gin Trp He Gin 

210 215 220 

Glu Asn Gin Glu Ser Tyr Leu ser Phe Asn Ser Thr Gly Asn ser Arg 
225 230 235 240 

ser Gin lie Lys Ala Ala Leu Asp Asn Ala Thr Lys lie Met ser Leu 

245 250 255 

Thr Lys ser Ala val Asp Tyr Leu val Gly Ser ser val Pro Glu Asp 

260 265 270 

lie Ser Lys Asn Arg lie Trp Gin lie Phe Glu Met Asn Gly Tyr Asp 
275 280 285 
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Pro Ala Tyr Ala Gly Ser lie Leu Tyr Gly Trp Cys Gin Arg Ser Phe 

290 295 300 

Asn Lys Arg Asn Thr val Trp Leu Tyr Gly pro Ala Thr Thr Gly Lys 
305 310 315 n 320 

Thr Asn lie Ala Glu Ala lie Ala His Thr Val Pro Phe Tyr Gly cys 

325 330 335 

Val Asn Trp Thr Asn Glu Asn Phe pro Phe Asn Asp Cys Val Asp Lys 

340 345 350 

Met Leu lie Trp Trp Glu Glu Gly Lys Met Thr Asn Lys Val Val Glu 

355 360 365 

ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg Val Asp Gin Lys 

370 375 380 

Cvs Lys ser ser val Gin lie Asp ser Thr Pro Val lie val Thr ser 
385 390 395 400 

Asn Thr Asn Met Cys Val val val Asp Gly Asn ser Thr Thr Phe Glu 

405 410 415 

His Gin Gin Pro Leu Glu Asp Arg Met Phe Lys Phe Glu Leu Thr Lys 

420 425 430 

Arq Leu Pro Pro Asp Phe Gly Lys lie Thr Lys Gin Glu Val Lys Asp 

435 440 445 

Phe Phe Ala Trp Ala Lys Val Asn Gin Val Pro Val Thr His Glu Phe 

450 455 460 

Lys val Pro Arg Glu Leu Ala Gly Thr Lys Gly Ala Glu Lys ser Leu 
465 470 475 480 

Lys Arq Pro Leu Gly Asp Val Thr Asn Thr ser Tyr Lys Ser Leu Glu 

485 490 495 

Lys Arg Ala Arg Leu Ser Phe Val Pro Glu Thr Pro Arg Ser Ser Asp 

500 505 510 

Val Thr val Asp Pro Ala Pro Leu Arg Pro Leu Asn Trp Asn Ser Arg 

515 520 525 

Tyr Asp Cys Lys Cys Asp Tyr His Ala Gin Phe Asp Asn lie ser Asn 

530 535 540 

Lys cys Asp Glu Cys Glu Tyr Leu Asn Arg Gly Lys Asn Gly cys lie 
545 550 555 560 

cys His Asn Val Thr His cys Gin lie Cys His Gly lie Pro Pro Trp 

565 570 ^ 575 

Glu Lys Glu Asn Leu ser Asp Phe Gly Asp Phe Asp Asp Ala Asn Lys 

580 585 590 

Glu Gin 



<210> 26 
<211> 724 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 26 

Met ser Phe val Asp His pro Pro Asp Trp Leu Glu Glu val Gly Glu 

1 5 10 15 

Gly Leu Arg Glu Phe Leu Gly Leu Glu Ala Gly Pro Pro Lys Pro Lys 

20 25 30 

Pro Asn Gin Gin His Gin Asp Gin Ala Arg Gly Leu val Leu Pro Gly 

35 40 45 

Tyr Asn Tyr Leu Gly Pro Gly Asn Gly Leu Asp Arg Gly Glu Pro Val 

50 55 60 

Asn Arg Ala Asp Glu Val Ala Arg Glu His Asp lie ser Tyr Asn Glu 
65 70 75 80 

Gin Leu Glu Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 

85 90 95 
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Ala Glu Phe Gin Glu Lys Leu Ala Asp Asp Thr ser Phe Gly Gly Asn 

100 105 110 

Leu Gly Lys Ala Val Phe Gin Ala Lys Lys Arg val Leu Glu Pro phe 

115 120 125 

Glv Leu val Glu Glu Gly Ala Lys Thr Ala Pro Thr Gly Lys Arg lie 

y 130 135 140 

asd asp His Phe Pro Lys Arg Lys Lys Ala Arg Thr Glu Glu Asp ser 
145 150 155 160 

Lvs pro ser Thr ser ser Asp Ala Glu Ala Gly Pro ser Gly Ser Gin 
J 165 170 175 

Gin Leu Gin lie Pro Ala Gin Pro Ala ser Ser Leu Gly Ala Asp Thr 

180 185 190 

Met ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gin Gly Ala 

195 200 205 

asp Gly val Gly Asn Ala ser Gly Asp Trp His cys Asp ser Thr Trp 

210 215 220 

Met Gly Asp Arg Val Val Thr Lys ser Thr Arg Thr Trp Val Leu Pro 
225 230 235 240 

ser Tyr Asn Asn His Gin Tyr Arg Glu lie Lys ser Gly Ser Val Asp 

245 250 255 

Gly ser Asn Ala Asn Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr 

260 265 270 

Phe Asp Phe Asn Arg Phe His Ser His Trp ser Pro Arg Asp Trp Gin 

275 280 285 

Arg Leu lie Asn Asn Tyr Trp Gly Phe Arg Pro Arg ser Leu Arg val 

290 295 300 

Lys lie Phe Asn He Gin Val Lys Glu Val Thr val Gin Asp ser Thr 
305 310 315 320 

Thr Thr lie Ala Asn Asn Leu Thr ser Thr Val Gin Val Phe Thr Asp 

325 330 . 335 

Asp Asp Tyr Gin Leu pro Tyr val val Gly Asn Gly. Thr Glu Gly Cys 

340 345 350 

Leu Pro Ala Phe Pro Pro Gin val Phe Thr Leu Pro Gin Tyr Gly Tyr 

355 360 _ 365 

Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser ser 

370 375 380 

Phe Phe Cys Leu Glu Tyr Phe Pro ser Lys Met Leu Arg Thr Gly Asn 
385 390 395 400 

Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu val Pro Phe His Ser Ser 

405 410 415 

Phe Ala pro Ser Gin Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 

420 425 430 

Gin Tyr Leu Tyr Arg Phe val Ser Thr Asn Asn Thr Gly Gly val Gin 

435 440 445 

Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 

450 455 460 

Phe Pro Gly Pro Met Gly Arg Thr Gin Gly Trp Asn Leu Gly Ser Gly 
465 470 475 480 

val Asn Arg Ala Ser val Ser Ala Phe Ala Thr Thr Asn Arg Met Glu 

485 490 495 

Leu Glu Gly Ala ser Tyr Gin val Pro Pro Gin Pro Asn Gly Met Thr 

500 505 510 

Asn Asn Leu Gin Gly ser Asn Thr Tyr Ala Leu Glu Asn Thr Met He 

515 520 525 

phe Asn ser Gin Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 

530 535 540 

Gly Asn Met Leu lie Thr ser Glu Ser Glu Thr Gin Pro val Asn Arq 
545 550 555 560 

val Ala Tyr Asn val Gly Gly Gin Met Ala Thr Asn Asn Gin ser ser 

565 570 575 

Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gin Glu lie Val Pro 

580 585 590 

Gly Ser val Trp Met Glu Arg Asp val Tyr Leu Gin Gly Pro He Trp 
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595 600 605 

Ala Lys lie Pro Glu Thr Gly Ala His Phe His Pro Ser Pro Ala Met 

610 615 620 

Gly Gly Phe Gly Leu Lys His Pro Pro pro Met Met Leu lie Lys Asn 
625 630 635 640 

Thr Pro Val Pro Gly Asn lie Thr Ser Phe ser Asp val Pro Val ser 

645 650 655 

ser Phe lie Thr Gin Tyr ser Thr Gly Gin val Thr val Glu Met Glu 

660 665 670 

Trp Glu Leu Lys Lys Glu Asn ser Lys Arg Trp Asn Pro Glu lie Gin 

675 * 680 685 

Tyr Thr Asn Asn Tyr Asn Asp Pro Gin Phe Val Asp Phe Ala Pro Asp 

690 695 700 

ser Thr Gly Glu Tyr Arg Thr Thr Arg Pro lie Gly Thr Arg Tyr Leu 
705 710 715 720 

Thr Arg Pro Leu 

<210> 27 
<211> 588 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



15 

Thr Ser Ser Asp Ala 
30 

lie Pro Ala Gin Pro 
45 

Gly Gly Gly Gly Pro 
60 

Gly Asn Ala ser Gly 

80 

Arg Val Val Thr Lys 

95 

Asn His Gin Tyr Arg 
110 

Ala Asn Ala Tyr Phe 
125 

Asn Arg Phe His ser 
140 

Asn Asn Tyr Trp Gly 

160 

Asn lie Gin Val Lys 

175 

Ala Asn Asn Leu Thr 
190 

Gin Leu Pro Tyr Val 
205 

Phe Pro Pro Gin val 
220 

Asn Arg Asp Asn Thr 

240 

Leu Glu Tyr Phe Pro 

255 

Phe Thr Tyr Asn Phe 
270 



<400> 27 




Gly 






lie Asp 




His 


Thr 


Ala 


Pro 


Thr 


Lys 


Arg 


Asp 


1 














10 




Lys 


Ala 


Arg 


Thr 


Glu 


Glu 


Asp 


Ser Lys 


Pro 


ser 




20 








25 




Gin 


Glu 


Ala 


Gly 


Pro 


ser 


Gly 


Ser 


Gin Gin 


Leu 






35 








40 




Ala 


Ala 


Ser 


ser 


Leu 


Gly 


Ala 


Asp 


Thr Met 


Ser 




50 








55 




Gly 


val 


Leu 


Gly 


Asp 


Asn 


Asn 


Gin 


Gly 


Ala Asp 


65 






70 






Gly 


75 


Asp 


Trp 


His 


cys 


Asp 


ser 


Thr 


Trp Met 


Asp 




85 








90 




ser 


Thr 


Arg 


Thr 


Trp 


val 


Leu 


pro ser 


Tyr 


Asn 






100 






105 






Glu 


He 


Lys 


ser 


Gly 


Ser 


val 


Asp Gly 


Ser 


Asn 






115 








120 




Phe 


Gly 


Tyr 


Ser 


Thr 


Pro 


Trp 


Gly 


Tyr Phe 


Asp 


130 










135 


Gin Arg 




He 


His 


Trp 


Ser 


Pro 


Arg 


Asp 


Trp 


Leu 


145 






150 




val Lys 


He 


155 


Phe 


Arg 


Pro 


Arg 


ser 


Leu 


Arg 


Phe 






165 






170 


He 


Glu 


Val 


Thr 


val 


Gin 


Asp 


Ser 


Thr Thr 


Thr 








180 






185 






Ser 


Thr 


val 


Gin 


Val 


Phe 


Thr 


Asp Asp 


Asp 


Tyr 






195 










200 




Ala 


val 


Gly 


Asn 


Gly 


Thr 


Glu 


Gly 


cys Leu 


Pro 




210 








215 


Tyr Ala 


Thr 




Phe 


Thr 


Leu 


Pro 


Gin 


Tyr 


Gly 


Leu 


225 










230 


Phe 


235 


Glu 


Asn 


Pro 


Thr 


Glu 


Arg 


Ser 


Ser Phe 


Cys 










245 






250 


Glu 


Ser 


Lys 


Met 


Leu 


Arg 


Thr 


Gly 


Asn Asn 


Phe 






260 






265 


Ala 




Glu 


Glu 


val 


Pro 


Phe 


His 


ser 


Ser Phe 


Pro 
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275 280 285 

lvs Leu Ala Asn Pro Leu val Asp Gin Tyr Leu Tyr Arg Phe val ser 

290 295 300 

Thr Asn Asn Thr Gly Gly Val Gin Phe Asn Lys Asn Leu Ala Gly Arg 
305 310 315 320 

Tvr Ala Asn Thr Tyr Lys Asn Trp Phe Pro Gly Pro Met Gly Arg Thr 

325 330 335 

Gin Gly Trp Asn Leu Gly Ser Gly Val Asn Arg Ala ser val Ser Ala 

340 345 350 

Phe Ala Thr Thr Asn Arg Met Glu Leu Glu Gly Ala ser Tyr Gin val 

355 360 365 

Pro Pro Gin Pro Asn Gly Met Thr Asn Asn Leu Gin Gly Ser Asn Thr 

370 375 380 

Tvr Ala Leu Glu Asn Thr Met lie Phe Asn Ser Gin Pro Ala Asn Pro 
385 390 395 400 

Glv Thr Thr Ala Thr Tyr Leu Glu Gly Asn Met Leu lie Thr ser Glu 

y 405 410 415 

Ser Glu Thr Gin Pro Val Asn Arg Val Ala Tyr Asn Val Gly Gly Gin 

420 425 430 

Met Ala Thr Asn Asn Gin Ser Ser Thr Thr Ala Pro Ala Thr Gly Thr 

435 440 445 

Tyr Asn Leu Gin Glu lie Val Pro Gly ser val Trp Met Glu Arg Asp 

450 455 460 

Val Tyr Leu Gin Gly Pro lie Trp Ala Lys He Pro Glu Thr Gly Ala 
465 470 475 . 480 

His Phe His Pro ser Pro Ala Met Gly Gly Phe Gly Leu Lys His Pro 

485 490 495 

pro Pro Met Met Leu He Lys Asn Thr Pro Val Pro Gly Asn lie Thr 

500 505 510 

Ser Phe Ser Asp Val Pro val Ser Ser Phe lie Thr Gin Tyr Ser Thr 

515 520 525 

Gly Gin Val Thr val Glu Met Glu Trp Glu Leu Lys Lys Glu Asn Ser 

530 535 540 

Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Asn Asn Tyr Asn Asp Pro 
545 " 550 555 560 

Gin Phe val Asp Phe Ala Pro Asp ser Thr Gly Glu Tyr Arg Thr Thr 

565 570 575 

Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 

580 585 

<210> 28 
<211> 532 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

Met Ser Ala Gly Gly Gly Gly Pro Leu Gly Asp Asn Asn Gin Gly Ala 

1 5 10 15 

Asp Gly val Gly Asn Ala ser Gly Asp Trp His Cys Asp ser Thr Trp 

20 25 30 

Met Gly Asp Arg Val val Thr Lys ser Thr Arg Thr Trp val Leu Pro 

35 40 45 

Ser Tyr Asn Asn His Gin Tyr Arg Glu He Lys ser Gly Ser val Asp 

50 55 60 

Gly ser Asn Ala Asn Ala Tyr Phe Gly Tyr ser Thr Pro Trp Gly Tyr 
65 70 .75 80 

Phe Asp Phe Asn Arg Phe His Ser His Trp Ser Pro Arg Asp Trp Gin 

85 90 95 

Arg Leu He Asn Asn Tyr Trp Gly Phe Arg Pro Arg Ser Leu Arg val 
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100 105 110 

Lvs lie Phe Asn lie Gin Val Lys Glu val Thr val Gin Asp ser Thr 

115 120 125 

Thr Thr lie Ala Asn Asn Leu Thr Ser Thr val Gin val Phe Thr Asp 

130 135 140 

asd Asp Tyr Gin Leu Pro Tyr Val Val Gly Asn Gly Thr Glu Gly cys 
145 150 155 160 

Leu Pro Ala Phe Pro Pro Gin val Phe Thr Leu Pro Gin Tyr Gly Tyr 

165 170 175 

Ala Thr Leu Asn Arg Asp Asn Thr Glu Asn Pro Thr Glu Arg Ser Ser 

180 185 190 

Phe Phe Cys Leu Glu Tyr Phe Pro Ser Lys Met Leu Arg Thr Gly Asn 

195 200 205 

Asn Phe Glu Phe Thr Tyr Asn Phe Glu Glu Val Pro Phe His ser Ser 

210 215 220 

Phe Ala Pro ser Gin Asn Leu Phe Lys Leu Ala Asn Pro Leu Val Asp 
225 230 235 240 

Gin Tyr Leu Tyr Arg Phe val ser Thr Asn Asn Thr Gly Gly Val Gin 

245 250 255 

Phe Asn Lys Asn Leu Ala Gly Arg Tyr Ala Asn Thr Tyr Lys Asn Trp 

260 265 270 

Phe Pro Gly Pro Met Gly Arg Thr Gin Gly Trp Asn Leu Gly ser Gly 

275 280 285 

Val Asn Arg Ala Ser Val ser Ala Phe Ala Thr Thr Asn Arg Met Glu 

290 295 300 

Leu Glu Gly Ala Ser Tyr Gin val Pro Pro Gin Pro Asn Gly Met Thr 
305 310 315 320 

Asn Asn Leu Gin Gly Ser Asn Thr Tyr Ala Leu Glu Asn Thr Met He 

325 330 335 

phe Asn Ser Gin Pro Ala Asn Pro Gly Thr Thr Ala Thr Tyr Leu Glu 

340 345 350 

Gly Asn Met Leu lie Thr ser Glu Ser Glu Thr Gin Pro Val Asn Arg 

355 360 365 

val Ala Tyr Asn val Gly Gly Gin Met Ala Thr Asn Asn Gin ser Ser 

370 375 380 

Thr Thr Ala Pro Ala Thr Gly Thr Tyr Asn Leu Gin Glu lie val Pro 
385 390 395 400 

Gly ser val Trp Met Glu Arg Asp val Tyr Leu Gin Gly Pro lie Trp 

y 405 410 415 

Ala Lys lie Pro Glu Thr Gly Ala His Phe His Pro ser Pro Ala Met 

420 425 430 

Gly Gly Phe Gly Leu Lys His Pro Pro Pro Met Met Leu lie Lys Asn 

435 440 445 

Thr Pro val Pro Gly Asn lie Thr Ser Phe ser Asp Val Pro val Ser 

450 455 460 

Ser Phe lie Thr Gin Tyr ser Thr Gly Gin Val Thr val Glu Met Glu 
465 470 475 480 

Trp Glu Leu Lys Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin 

485 490 495 

Tyr Thr Asn Asn Tyr Asn Asp Pro Gin Phe val Asp Phe Ala Pro Asp 

7 500 505 510 

ser Thr Gly Glu Tyr Arg Thr Thr Arg Pro lie Gly Thr Arg Tyr Leu 

515 520 525 

Thr Arg Pro Leu 
530 

<210> 29 
<211> 2307 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
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synthetic construct 
<400> 29 

aggctctcat ttgttcccga gacgcctcgc agttcagacg tgactgttga tcccgctcct 60 

ctgcgaccgc tcaattggaa ttcaagtaaa taaagcgagt agtcatgtct tttgttgatc 120 

accctccaga ttggttggaa gaagttggtg aaggtcttcg cgagtttttg ggccttgaag 180 

cgggcccacc gaaaccaaaa cccaatcagc agcatcaaga tcaagcccgt ggtcttgtgc 240 

tgcctggtta taactatctc ggacccggaa acggtctcga tcgaggagag cctgtcaaca 300 

gggcagacga ggtcgcgcga gagcacgaca tctcgtacaa cgagcagctt gaggcgggag 360 

acaaccccta cctcaagtac aaccacgcgg acgccgagtt tcaggagaag ctcgccgacg 420 

acacatcctt cgggggaaac ctcggaaagg cagtctttca ggccaagaaa agggttctcg 480 

aaccttttgg cctggttgaa gagggtgcta agacggcccc taccggaaag cggatagacg 540 

accactttcc aaaaagaaag aaggctcgga ccgaagagga ctccaagcct tccacctcgt 600 

cagacgccga agctggaccc agcggatccc agcagctgca aatcccagcc caaccagcct 660 

caagtttggg agctgataca atgtctgcgg gaggtggcgg cccattgggc gacaataacc 720 

aaggtgccga tggagtgggc aatgcctcgg gagattggca ttgcgattcc acgtggatgg 780 

gggacagagt cgtcaccaag tccacccgaa cctgggtgct gcccagctac aacaaccacc 840 

agtaccgaga gatcaaaagc ggctccgtcg acggaagcaa cgccaacgcc tactttggat 900 

acagcacccc ctgggggtac tttgacttta accgcttcca cagccactgg agcccccgag 960 

actggcaaag actcatcaac aactactggg gcttcagacc ccggtccctc agagtcaaaa 1020 

tcttcaacat tcaagtcaaa gaggtcacgg tgcaggactc caccaccacc atcgccaaca 1080 

acctcacctc caccgtccaa gtgtttacgg acgacgacta ccagctgccc tacgtcgtcg 1140 

gcaacgggac cgagggatgc ctgccggcct tccctccgca ggtctttacg ctgccgcagt 1200 

acggttacgc gacgctgaac cgcgacaaca cagaaaatcc caccgagagg agcagcttct 1260 

tctgcctaga gtactttccc agcaagatgc tgagaacggg caacaacttt gagtttacct 1320 

acaactttga ggaggtgccc ttccactcca gcttcgctcc cagtcagaac ctgttcaagc 1380 

tggccaaccc gctggtggac cagtacttgt accgcttcgt gagcacaaat aacactggcg 1440 

gagtccagtt caacaagaac ctggccggga gatacgccaa cacctacaaa aactggttcc 1500 

cggggcccat gggccgaacc cagggctgga acctgggctc cggggtcaac cgcgccagtg 1560 

tcagcgcctt cgccacgacc aataggatgg agctcgaggg cgcgagttac caggtgcccc 1620 

cgcagccgaa cggcatgacc aacaacctcc agggcagcaa cacctatgcc ctggagaaca 1680 

ctatgatctt caacagccag ccggcgaacc cgggcaccac cgccacgtac ctcgagggca 1740 

acatgctcat caccagcgag agcgagacgc agccggtgaa ccgcgtggcg tacaacgtcg 1800 

gcgggcagat ggccaccaac aaccagagct ccaccactgc ccccgcgacc ggcacgtaca 1860 

acctccagga aatcgtgccc ggcagcgtgt ggatggagag ggacgtgtac ctccaaggac 1920 

ccatctgggc caagatccca gagacggggg cgcactttca cccctctccg gccatgggcg 1980 

gattcggact caaacaccca ccgcccatga tgctcatcaa gaacacgcct gtgcccggaa 2040 

atatcaccag cttctcggac gtgcccgtca gcagcttcat cacccagtac agcaccgggc 2100 

aggtcaccgt ggagatggag tgggagctca agaaggaaaa ctccaagagg tggaacccag 2160 

agatccagta cacaaacaac tacaacgacc cccagtttgt ggactttgcc ccggacagca 2220 

ccggggaata cagaaccacc agacctatcg gaacccgata ccttacccga cccctttaac 2280 

ccattcatgt cgcataccct caataaa 2307 

<210> 30 
<211> 2264 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 30 

aggctctcat ttgttcccga gacgcctcgc agttcagacg tgactgttga tcccgctcct 60 

ctgcgaccgc tcaattggaa ttcaagattg gttggaagaa gttggtgaag gtcttcgcga 120 

gtttttgggc cttgaagcgg gcccaccgaa accaaaaccc aatcagcagc atcaagatca 180 

agcccgtggt cttgtgctgc ctggttataa ctatctcgga cccggaaacg gtctcgatcg 240 

aggagagcct gtcaacaggg cagacgaggt cgcgcgagag cacgacatct cgtacaacga 300 

gcagcttgag gcgggagaca acccctacct caagtacaac cacgcggacg ccgagtttca 360 

ggagaagctc gccgacgaca catccttcgg gggaaacctc ggaaaggcag tctttcaggc 420 

caagaaaagg gttctcgaac cttttggcct ggttgaagag ggtgctaaga cggcccctac 480 

cggaaagcgg atagacgacc actttccaaa aagaaagaag gctcggaccg aagaggactc 540 

caagccttcc acctcgtcag acgccgaagc tggacccagc ggatcccagc agctgcaaat 600 
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cccagcccaa ccagcctcaa gtttgggagc tgatacaatg tctgcgggag gtggcggccc 660 

attgggcgac aataaccaag gtgccgatgg agtgggcaat gcctcgggag attggcattg 720 

cgattccacg tggatggggg acagagtcgt caccaagtcc acccgaacct gggtgctgcc 780 

cagctacaac aaccaccagt accgagagat caaaagcggc tccgtcgacg gaagcaacgc 840 

caacgcctac tttggataca gcaccccctg ggggtacttt gactttaacc gcttccacag 900 

ccactggagc ccccgagact ggcaaagact catcaacaac tactggggct tcagaccccg 960 

gtccctcaga gtcaaaatct tcaacattca agtcaaagag gtcacggtgc aggactccac 1020 
caccaccatc gccaacaacc tcacctccac cgtccaagtg tttacggacg acgactacca , 1080 

gctgccctac gtcgtcggca acgggaccga gggatgcctg ccggccttcc ctccgcaggt 1140 

ctttacgctg ccgcagtacg gttacgcgac gctgaaccgc gacaacacag aaaatcccac 1200 

cgagaggagc agcttcttct gcctagagta ctttcccagc aagatgctga gaacgggcaa 1260 

caactttgag tttacctaca actttgagga ggtgcccttc cactccagct tcgctcccag 1320 

tcagaacctg ttcaagctgg ccaacccgct ggtggaccag tacttgtacc gcttcgtgag 1380 

cacaaataac actggcggag tccagttcaa caagaacctg gccgggagat acgccaacac 1440 

ctacaaaaac tggttcccgg ggcccatggg ccgaacccag ggctggaacc tgggctccgg 1500 

ggtcaaccgc gccagtgtca gcgccttcgc cacgaccaat aggatggagc tcgagggcgc 1560 

gagttaccag gtgcccccgc agccgaacgg catgaccaac aacctccagg gcagcaacac 1620 

ctatgccctg gagaacacta tgatcttcaa cagccagccg gcgaacccgg gcaccaccgc 1680 

cacgtacctc gagggcaaca tgctcatcac cagcgagagc gagacgcagc cggtgaaccg 1740 

cgtggcgtac aacgtcggcg ggcagatggc caccaacaac cagagctcca ccactgcccc 1800 

cgcgaccggc acgtacaacc tccaggaaat cgtgcccggc agcgtgtgga tggagaggga 1860 

cgtgtacctc caaggaccca tctgggccaa gatcccagag acgggggcgc actttcaccc 1920 

ctctccggcc atgggcggat tcggactcaa acacccaccg cccatgatgc tcatcaagaa 1980 

cacgcctgtg cccggaaata tcaccagctt ctcggacgtg cccgtcagca gcttcatcac 2040 

ccagtacagc accgggcagg tcaccgtgga gatggagtgg gagctcaaga aggaaaactc 2100 

caagaggtgg aacccagaga tccagtacac aaacaactac aacgaccccc agtttgtgga zibo 

ctttgccccg gacagcaccg gggaatacag aaccaccaga cctatcggaa cccgatacct 2220 

tacccgaccc ctttaaccca ttcatgtcgc ataccctcaa taaa 2264 

<210> 31 
<211> 2264 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 31 ^ ^ 

aggctctcat ttgttcccga gacgcctcgc agttcagacg tgactgttga tcccgctcct 60 

ctgcgaccgc tcaattggaa ttcaagattg gttggaagaa gttggtgaag gtcttcgcga 120 

gtttttgggc cttgaagcgg gcccaccgaa accaaaaccc aatcagcagc atcaagatca 180 

agcccgtggt cttgtgctgc ctggttataa ctatctcgga cccggaaacg gtctcgatcg 240 

aggagagcct gtcaacaggg cagacgaggt cgcgcgagag cacgacatct cgtacaacga 300 

gcagcttgag gcgggagaca acccctacct caagtacaac cacgcggacg ccgagtttca 360 

ggagaagctc gccgacgaca catccttcgg gggaaacctc ggaaaggcag tctttcaggc 420 

caagaaaagg gttctcgaac cttttggcct ggttgaagag ggtgctaaga cggcccctac 480 

cggaaagcgg atagacgacc actttccaaa aagaaagaag gctcggaccg aagaggactc 540 

caagccttcc acctcgtcag acgccgaagc tggacccagc ggatcccagc agctgcaaat 600 

cccagcccaa ccagcctcaa gtttgggagc tgatacaatg tctgcgggag gtggcggccc 660 

attgggcgac aataaccaag gtgccgatgg agtgggcaat gcctcgggag attggcattg 720 

cgattccacg tggatggggg acagagtcgt caccaagtcc acccgaacct gggtgctgcc 780 

900 
960 



cagctacaac aaccaccagt accgagagat caaaagcggc tccgtcgacg gaagcaacgc 

caacgcctac tttggataca gcaccccctg ggggtacttt gactttaacc gcttccacag 

ccactggagc ccccgagact ggcaaagact catcaacaac tactggggct tcagaccccg 

gtccctcaga gtcaaaatct tcaacattca agtcaaagag gtcacggtgc aggactccac 1020 

caccaccatc gccaacaacc tcacctccac cgtccaagtg tttacggacg acgactacca 1080 

gctgccctac gtcgtcggca acgggaccga gggatgcctg ccggccttcc ctccgcaggt 1140 

ctttacgctg ccgcagtacg gttacgcgac gctgaaccgc gacaacacag aaaatcccac 1200 

cgagaggagc agcttcttct gcctagagta ctttcccagc aagatgctga gaacgggcaa 1260 

caactttgag tttacctaca actttgagga ggtgcccttc cactccagct tcgctcccag 1320 

tcagaacctg ttcaagctgg ccaacccgct ggtggaccag tacttgtacc gcttcgtgag 1380 

cacaaataac actggcggag tccagttcaa caagaacctg gccgggagat acgccaacac 1440 
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ctacaaaaac tggttcccgg ggcccatggg ccgaacccag ggctggaacc tgggctccgg 1500 

ggtcaaccgc gccagtgtca gcgccttcgc cacgaccaat aggatggagc tcgagggcgc 1560 

gagttaccag gtgcccccgc agccgaacgg catgaccaac aacctccagg gcagcaacac 1620 

ctatgccctg gagaacacta tgatcttcaa cagccagccg gcgaacccgg gcaccaccgc 1680 

cacgtacctc gagggcaaca tgctcatcac cagcgagagc gagacgcagc cggtgaaccg 1740 

cgtggcgtac aacgtcggcg ggcagatggc caccaacaac cagagctcca ccactgcccc 1800 

cgcgaccggc acgtacaacc tccaggaaat cgtgcccggc agcgtgtgga tggagaggga 1860 

cgtgtacctc caaggaccca tctgggccaa gatcccagag acgggggcgc actttcaccc 1920 

ctctccggcc atgggcggat tcggactcaa acacccaccg cccatgatgc tcatcaagaa 1980 

cacgcctgtg cccggaaata tcaccagctt ctcggacgtg cccgtcagca gcttcatcac 2040 

ccagtacagc accgggcagg tcaccgtgga gatggagtgg gagctcaaga aggaaaactc 2100 

caagaggtgg aacccagaga tccagtacac aaacaactac aacgaccccc agtttgtgga 2160 

ctttgccccg gacagcaccg gggaatacag aaccaccaga cctatcggaa cccgatacct 2220 

tacccgaccc ctttaaccca ttcatgtcgc ataccctcaa taaa 2264 

<210> 32 
<211> 1292 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 32 

agcgcaaacg gctcgtcgcg cagtttctgg cagaatcctc gcagcgctcg caggaggcgg 60 

cttcgcagcg tgagttctcg gctgacccgg tcatcaaaag caagacttcc cagaaataca 120 

tggcgctcgt caactggctc gtggagcacg gcatcacttc cgagaagcag tggatccagg 180 

aaaatcagga gagctacctc tccttcaact ccaccggcaa ctctcggagc cagatcaagg 240 

ccgcgctcga caacgcgacc aaaattatga gtctgacaaa aagcgcggtg gactacctcg 300 

tggggagctc cgttcccgag gacatttcaa aaaacagaat ctggcaaatt tttgagatga 360 

atggctacga cccggcctac gcgggatcca tcctctacgg ctggtgtcag cgctccttca 420 

acaagaggaa caccgtctgg ctctacggac ccgccacgac cggcaagacc aacatcgcgg 480 

aggccatcgc ccacactgtg cccttttacg gctgcgtgaa ctggaccaat gaaaactttc 540 

cctttaatga ctgtgtggac aaaatgctca tttggtggga ggagggaaag atgaccaaca 600 

aggtggttga atccgccaag gccatcctgg ggggctcaaa ggtgcgggtc gatcagaaat 660 

gtaaatcctc tgttcaaatt gattctaccc ctgtcattgt aacttccaat acaaacatgt 720 

gtgtggtggt ggatgggaat tccacgacct ttgaacacca gcagccgctg gaggaccgca 780 

tgttcaaatt tgaactgact aagcggctcc cgccagattt tggcaagatt actaagcagg 840 

aagtcaagga cttttttgct tgggcaaagg tcaatcaggt gccggtgact cacgagttta 900 

aagttcccag ggaattggcg ggaactaaag gggcggagaa atctctaaaa cgcccactgg 960 

gtgacgtcac caatactagc tataaaagtc tggagaagcg ggccaggctc tcatttgttc 1020 

ccgagacgcc tcgcagttca gacgtgactg ttgatcccgc tcctctgcga ccgctcaatt 1080 

ggaattcaag gtatgattgc aaatgtgact atcatgctca atttgacaac atttctaaca 1140 

aatgtgatga atgtgaatat ttgaatcggg gcaaaaatgg atgtatctgt cacaatgtaa 1200 

ctcactgtca aatttgtcat gggattcccc cctgggaaaa ggaaaacttg tcagattttg 1260 

gggattttga cgatgccaat aaagaacagt aa 1292 

<210> 33 
<211> 1870 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 33 ^ , cn 

attctttgct ctggactgct agaggaccct cgctgccatg gctaccttct atgaagtcat 60 

tgttcgcgtc ccatttgacg tggaggaaca tctgcctgga atttctgaca gctttgtgga 120 

ctgggtaact ggtcaaattt gggagctgcc tccagagtca gatttaaatt tgactctggt 180 

tgaacagcct cagttgacgg tggctgatag aattcgccgc gtgttcctgt acgagtggaa 240 

caaattttcc aagcaggagt ccaaattctt tgtgcagttt gaaaagggat ctgaatattt 300 
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<210> 34 
<211> 330 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 34 
















Gly 


He 


Thr 




Glu 




Met Ala 


Leu 


Val 


Asn 


Trp 


Leu 


val 


Glu 


His 


ser 


Lys 


1 






5 








10 




Phe 




15 


Thr 


Gin Trp 


He 


Gin 


Glu 


Asn 


Gin 


Glu 


Ser 


Tyr 


Leu 


Ser 


Asn 


Ser 




20 










25 








30 


Thr 




Gly Asn 


ser 


Arg 


ser 


Gin 


lie 


Lys 


Ala 


Ala 


Leu 


Asp 


Asn 


Ala 


Lys 


35 








40 










45 


Gly 






He Met 


ser 


Leu 


Thr 


Lys 


ser 


Ala 


val 


Asp 


Tyr 


Leu 


val 


Ser 


Ser 


50 








55 








60 


He 


Phe 


Glu 




val Pro 


Glu 


Asp 


lie 


Ser 


Lys 


Asn 


Arg 


lie 


Trp 


Gin 


Met 


65 






70 








75 






Gly 




80 


Asn Gly 


Tyr 


Asp 


Pro 


Ala 


Tyr 


Ala 


Gly 


Ser 


lie 


Leu 


Tyr 


Trp 


Cys 


85 








90 








Gly 


95 


Ala 


Gin Arg 


ser 


Phe 


Asn 


Lys 


Arg 


Asn 


Thr 


Val 


Trp 


Leu 


Tyr 


Pro 




100 






105 










110 


Val 




Thr Thr 


Gly 


Lys 


Thr 


Asn 


He 


Ala 


Glu 


Ala 


He 


Ala 


His 


Thr 


Pro 




115 








120 








Phe 


125 


■ 

Phe 






Phe Tyr 


Gly 


Cys 


val 


Asn 


Trp 


Thr 


Asn 


Glu 


Asn 


Pro 


Asn 


Asp 


130 






135 








Glu 


140 






Thr 




Cys Val 


Asp 


Lys 


Met 


Leu 


He 


Trp 


Trp 


Glu 


Gly 


Lys 


Met 


Asn 


145 




150 










155 


Gly 






val 


160 


Lys val 


Val 


Glu 


ser 


Ala 


Lys 


Ala 


lie 


Leu 


Gly 


Ser 


Lys 


Arg 






165 








170 






Thr 


175 


val 


val Asp 


Gin 


Lys 


cys 


Lys 


ser 


ser 


val 


Gin 


He 


Asp 


Ser 


Pro 




180 






185 






val 




190 






lie val 


Thr 


Ser 


Asn 


Thr 


Asn 


Met 


Cys 


val 


val 


Asp 


Gly 


Asn 


Ser 



tcatctgcac acgcttgtgg agacctccgg catcrcttcc atggncctcg gccgctacgt 360 

gagtcagatt cgcgcccagc tggtgaaagt ggtcttccag ggaattgaac cccagatcaa 420 

cgactgggtc gccatcacca aggtaaagaa gggcggagcc aataaggtgg tggattctgg 480 

gtatattccc gcctacctgc tgccgaaggt ccaaccggag cttcagtggg cgtggacaaa 540 

cctggacgag tataaattgg ccgccctgaa tctggaggag cgcaaacggc tcgtcgcgca 600 

gtttctggca gaatcctcgc agcgctcgca ggaggcggct tcgcagcgtg agttctcggc 660 

tgacccggtc atcaaaagca agacttccca gaaatacatg gcgctcgtca actggctcgt 720 

ggagcacggc atcacttccg agaagcagtg gatccaggaa aatcaggaga gctacctctc 780 

cttcaactcc accggcaact ctcggagcca gatcaaggcc gcgctcgaca acgcgaccaa 840 

aattatgagt ctgacaaaaa gcgcggtgga ctacctcgtg gggagctccg ttcccgagga 900 

catttcaaaa aacagaatct ggcaaatttt tgagatgaat ggctacgacc cggcctacgc 960 

gggatccatc ctctacggct ggtgtcagcg ctccttcaac aagaggaaca ccgtctggct 1020 

ctacggaccc gccacgaccg gcaagaccaa catcgcggag gccatcgccc acactgtgcc 1080 

cttttacggc tgcgtgaact ggaccaatga aaactttccc tttaatgact gtgtggacaa 1140 

aatgctcatt tggtgggagg agggaaagat gaccaacaag gtggttgaat ccgccaaggc 1200 

catcctgggg ggctcaaagg tgcgggtcga tcagaaatgt aaatcctctg ttcaaattga 1260 

ttctacccct gtcattgtaa cttccaatac aaacatgtgt gtggtggtgg atgggaattc 1320 

cacgaccttt gaacaccagc agccgctgga ggaccgcatg ttcaaatttg aactgactaa 1380 

gcggctcccg ccagattttg gcaagattac taagcaggaa gtcaaggact tttttgcttg 1440 

ggcaaaggtc aatcaggtgc cggtgactca cgagtttaaa gttcccaggg aattggcggg 1500 

aactaaaggg gcggagaaat ctctaaaacg cccactgggt gacgtcacca atactagcta 1560 

taaaagtctg gagaagcggg ccaggctctc atttgttccc gagacgcctc gcagttcaga 1620 

cgtgactgtt gatcccgctc ctctgcgacc gctcaattgg aattcaaggt atgattgcaa 1680 

atgtgactat catgctcaat ttgacaacat ttctaacaaa tgtgatgaat gtgaatattt 1740 

gaatcggggc aaaaatggat gtatctgtca caatgtaact cactgtcaaa tttgtcatgg 1800 

gattcccccc tgggaaaagg aaaacttgtc agattttggg gattttgacg atgccaataa 1860 
agaacagtaa 
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195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Glu Asp Arg Met Phe Lys Phe 

210 215 220 s 

Glu Leu Thr Lys Arg Leu Pro Pro Asp Phe Giy Lys lie Thr Lys Gin 

225 230 235 240 

Glu val Lys Asp Phe Phe Ala Trp Ala Lys val Asn Gin Val Pro val 

Thr His Glu Phe Lys Val Pro Arg Glu Leu Ala Giy Thr Lys Giy Ala . 

260 265 270 

Glu Lys ser Leu Lys Arg Pro Leu Giy Asp Val Thr Asn Thr Ser Tyr 

275 280 285 

Lys ser Leu Glu Lys Arg Ala Arg Leu ser Phe Val Pro Glu Thr Pro 

290 295 300 

Arg ser Ser Asp val Thr val Asp Pro Ala Pro Leu Arg Pro Leu Asn 
305 310 315 320 

Trp Asn ser Arg Leu val Giy Arg Ser Trp 

325 330 

<210> 35 
<211> 1115 
<212> DNA 

<213> Artificial sequence 
^2 20^ 

<223> Description of Artificial sequence; note = 
synthetic construct 

aggagcgcaa acggctcgtc gcgcagtttc tggcagaatc ctcgcagcgc tcgcaggagg 60 

cggcttcgca gcgtgagttc tcggctgacc cggtcatcaa aagcaagact tcccagaaat 120 

acatggcgct cgtcaactgg ctcgtggagc acggcatcac ttccgagaag cagtggatcc 180 

aggaaaatca ggagagctac ctctccttca actccaccgg caactctcgg agccagatca 240 

aggccgcgct cgacaacgcg accaaaatta tgagtctgac aaaaagcgcg gtggactacc 300 

tcgtggggag ctccgttccc gaggacattt caaaaaacag aatctggcaa atttttgaga 360 

tgaatggcta cgacccggcc tacgcgggat ccatcctcta cggctggtgt cagcgctcct 420 

tcaacaigag gaacaccgtc tggctctacg gacccgccac gaccggcaag accaacatcg 480 

cggaggccat cgcccacact gtgccctttt acggctgcgt gaactggacc aatgaaaact 540 

ttccctttaa tgactgtgtg gacaaaatgc tcatttggtg ggaggaggga aagatgacca 600 

acaaggtggt tgaatccgcc aaggccatcc tggggggctc aaaggtgcgg gtcgatcaga 660 

aatgtaaitc ctctgttcaa attgattcta cccctgtcat tgtaacttcc aatacaaaca 720 

tgtgtgtggt ggtggatggg aattccacga cctttgaaca ccagcagccg ctggaggacc 780 

gcatgttcaa atttgaactg actaagcggc tcccgccaga ttttggcaag attactaagc 840 

aggaagtcaa ggactttttt gcttgggcaa aggtcaatca ggtgccggtg actcacgagt 900 

ttaaagttcc cagggaattg gcgggaacta aaggggcgga gaaatctcta aaacgcccac 960 

tgggtgacgt caccaatact agctataaaa gtctggagaa gcgggccagg ctctcatttg 1020 

ttcccgagac gcctcgcagt tcagacgtga ctgttgatcc cgctcctctg cgaccgctca 1080 
attggaattc aagattggtt ggaagaagtt ggtga 

<210> 36 
<211> 550 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

Met°Ala 6 Thr Phe Tyr Glu val He val Arg val Pro Phe Asp Val Glu 
1 5 10 15 

Glu His Leu Pro Giy lie Ser Asp Ser Phe Val Asp Trp Val Thr Giy 

20 25 30 

Gin lie Trp Glu Leu Pro Pro Glu Ser Asp Leu Asn Leu Thr Leu val 
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35 40 45 

Glu Gin Pro Gin Leu Thr Val Ala Asp Arg lie Arg Arg val Phe Leu 

50 55 60 

Tvr Glu Trp Asn Lys Phe Ser Lys Gin Glu Ser Lys Phe Phe val Gin 
65 70 75 80 

Phe Glu. Lys Gly Ser Glu Tyr Phe His Leu His Thr Leu Val Glu Thr 

85 90 95 

ser Gly lie ser Ser Met Val Leu Gly Arg Tyr Val Ser Gin lie Arg 

100 105 110 

Ala Gin Leu Val Lys Val Val Phe Gin Gly He Glu Pro Gin lie Asn 

115 120 125 

Asp Trp val Ala lie Thr Lys Val Lys Lys Gly Gly Ala Asn Lys val 

130 135 140 

Val Asp ser Gly Tyr lie Pro Ala Tyr Leu Leu Pro Lys Val Gin Pro 
145 150 155 160 

Glu Leu Gin Trp Ala Trp Thr Asn Leu Asp Glu Tyr Lys Leu Ala Ala 

165 170 175 

Leu Asn Leu Glu Glu Arg Lys Arg Leu Val Ala Gin Phe Leu Ala Glu 

180 185 190 

Ser ser Gin Arg Ser Gin Glu Ala Ala ser Gin Arg Glu Phe Ser Ala 

195 200 205 

Asp Pro Val lie Lys Ser Lys Thr Ser Gin Lys Tyr Met Ala Leu val 

210 215 220 

Asn Trp Leu val Glu His Gly lie Thr Ser Glu Lys Gin Trp He Gin 
225 230 235 240 

Glu Asn Gin Glu Ser Tyr Leu Ser Phe Asn ser Thr Gly Asn Ser Arg 

245 250 255 

ser Gin lie Lys Ala Ala Leu Asp Asn Ala Thr Lys lie Met ser Leu 

260 265 270 

Thr Lys Ser Ala Val Asp Tyr Leu Val Gly Ser ser val Pro Glu Asp 

275 280 285 

lie Ser Lys Asn Arg lie Trp Gin lie Phe Glu Met Asn Gly Tyr Asp 

290 " 295 300 

pro Ala Tyr Ala Gly Ser lie Leu Tyr Gly Trp cys Gin Arg ser Phe 
305 310 315 320 

Asn Lys Arg Asn Thr val Trp Leu Tyr Gly Pro Ala Thr Thr Gly Lys 

325 330 335 

Thr Asn lie Ala Glu Ala lie Ala His Thr Val Pro Phe Tyr Gly Cys 

340 345 350 

val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp cys Val Asp Lys 

355 360 365 

Met Leu lie Trp Trp Glu Glu Gly Lys Met Thr Asn Lys val val Glu 

370 375 * 380 

ser Ala Lys Ala lie Leu Gly Gly Ser Lys Val Arg val Asp Gin Lys 
385 390 395 400 

Cys Lys ser ser val Gin lie Asp Ser Thr Pro val He val Thr Ser 

405 410 415 

Asn Thr Asn Met Cys Val val val Asp Gly Asn Ser Thr Thr Phe Glu 

420 425 430 

His Gin Gin Pro Leu Glu Asp Arg Met Phe Lys Phe Glu Leu Thr Lys 

435 440 445 

Arg Leu Pro Pro Asp Phe Gly Lys lie Thr Lys Gin Glu val Lys Asp 

450 455 460 

Phe Phe Ala Trp Ala Lys Val Asn Gin val Pro Val Thr His Glu Phe 
465 470 475 480 

Lys val Pro Arg Glu Leu Ala Gly Thr Lys Gly Ala Glu Lys ser Leu 

485 490 495 

Lys Arg pro Leu Gly Asp val Thr Asn Thr ser Tyr Lys ser Leu Glu 

500 505 510 

Lys Arg Ala Arg Leu Ser Phe val Pro Glu Thr Pro Arg Ser Ser Asp 

515 520 525 

val Thr Val Asp Pro Ala Pro Leu Arg Pro Leu Asn Trp Asn Ser Arg 
530 535 540 
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Leu Val Gly Arg Ser Trp 
545 550 

<210> 37 
<211> 1690 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 37 

attctttgct ctggactgct agaggaccct cgctgccatg gctaccttct atgaagtcat 60 

tgttcgcgtc ccatttgacg tggaggaaca tctgcctgga atttctgaca gctttgtgga 120 

ctgggtaact ggtcaaattt gggagctgcc tccagagtca gatttaaatt tgactctggt 180 

tgaacagcct cagttgacgg tggctgatag aattcgccgc gtgttcctgt acgagtggaa 240 

caaattttcc aagcaggagt ccaaattctt tgtgcagttt gaaaagggat ctgaatattt 300 

tcatctgcac acgcttgtgg agacctccgg catctcttcc atggtcctcg gccgctacgt 360 

gagtcagatt cgcgcccagc tggtgaaagt ggtcttccag ggaattgaac cccagatcaa 420 

cgactgggtc gccatcacca aggtaaagaa gggcggagcc aataaggtgg tggattctgg 480 

gtatattccc gcctacctgc tgccgaaggt ccaaccggag cttcagtggg cgtggacaaa 540 

cctggacgag tataaattgg ccgccctgaa tctggaggag cgcaaacggc tcgtcgcgca 600 

gtttctggca gaatcctcgc agcgctcgca ggaggcggct tcgcagcgtg agttctcggc 660 

tgacccggtc atcaaaagca agacttccca gaaatacatg gcgctcgtca actggctcgt 720 

ggagcacggc atcacttccg agaagcagtg gatccaggaa aatcaggaga gctacctctc 780 

cttcaactcc accggcaact ctcggagcca gatcaaggcc gcgctcgaca acgcgaccaa 840 

aattatgagt ctgacaaaaa gcgcggtgga ctacctcgtg gggagctccg ttcccgagga 900 

catttcaaaa aacagaatct ggcaaatttt tgagatgaat ggctacgacc cggcctacgc 960 

gggatccatc ctctacggct ggtgtcagcg ctccttcaac aagaggaaca ccgtctggct 1020 

ctacggaccc gccacgaccg gcaagaccaa catcgcggag gccatcgccc acactgtgcc 1080 

cttttacggc tgcgtgaact ggaccaatga aaactttccc tttaatgact gtgtggacaa 1140 

aatgctcatt tggtgggagg agggaaagat gaccaacaag gtggttgaat ccgccaaggc 1200 

catcctgggg ggctcaaagg tgcgggtcga tcagaaatgt aaatcctctg ttcaaattga 1260 

ttctacccct gtcattgtaa cttccaatac aaacatgtgt gtggtggtgg atgggaattc 1320 

cacgaccttt gaacaccagc agccgctgga ggaccgcatg ttcaaatttg aactgactaa 1380 

gcggctcccg ccagattttg gcaagattac taagcaggaa gtcaaggact tttttgcttg 1440 

ggcaaaggtc aatcaggtgc cggtgactca cgagtttaaa gttcccaggg aattggcggg 1500 

aactaaaggg gcggagaaat ctctaaaacg cccactgggt gacgtcacca atactagcta 1560 

taaaagtctg gagaagcggg ccaggctctc atttgttccc gagacgcctc gcagttcaga 1620 

cgtgactgtt gatcccgctc ctctgcgacc gctcaattgg aattcaagat tggttggaag 1680 
aagttggtga 



1690 



<210> 38 
<211> 145 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 38 

ccatcaccaa ggtaaagaag ggcggagcca ataaggtggt ggattctggg tatattcccg 60 

cctacctgct gccgaaggtc caaccggagc ttcagtgggc gtggacaaac ctggacgagt 120 

ataaattggc cgccctgaat ctgga 145 

<210> 39 
<211> 174 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence; nore = 
synthetic construct 

<400> 39 , , cn 

taagcaggaa gtcaaggact tttttgcttg ggcaaaggtc aatcaggtgc cggtgactca 60 

cgagtttaaa gttcccaggg aattggcggg aactaaaggg gcggagaaat ctctaaaacg 120 

cccactgggt gacgtcacca atactagcta taaaagtctg gagaagcggg ccag 1/4 

<210> 40 
<211> 187 
<212> DNA 

<213> Artificial Sequence 

<220> . - 

<223> Description of Artificial sequence; note = 

synthetic construct 

cactctcaag caagggggtt ttgtaagcag tgatgtcata atgatgtaat gcttattgtc 60 

acgcgatagt taatgattaa cagtcatgtg atgtgtttta tccaatagga agaaagcgcg 120 

cgtatgagtt ctcgcgagac ttccggggta taaaagaccg agtgaacgag cccgccgcca 180 

ttctttg Xu/ 



<210> 41 
<211> 168 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note » 
synthetic construct 

<400> 41 , , , n 

aaacctcctt gcttgagagt gtggcactct cccccctgtc gcgttcgctc gctcgctggc 60 

tcgtttgggg gggtggcagc tcaaagagct gccagacgac ggccctctgg ccgtcgcccc 12U 

cccaaacgag ccagcgagcg agcgaacgcg acagggggga gagtgcca loo 

<210> 42 
<211> 168 
<212> DNA 

<213> Artificial sequence 

<220> . , 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 42 CA 

aaacctcctt gcttgagagt gtggcactct cccccctgtc gcgttcgctc gctcgctggc 60 

tcgtttgggg gggcgacggc cagagggccg tcgtctgccg gctctttgag ctgccacccc i£U 

cccaaacgag ccagcgagcg agcgaacgcg acagggggga gagtgcca J-oo 

<210> 43 
<211> 8 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 



<400> 43 
cggtgtga 
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8 



21 



<210> 44 
<211> 8 
<212> DNA 

<213> Artificial Sequence 

<220> _ .,. . , 

<223> Description of Artificial Sequence; note = 

synthetic construct 

<400> 44 
cggttgag 

<210> 45 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 45 

caaaacctcc ttgcttgaga g 

<210> 46 

<211> 4675 

<212> DNA 

<213> Artificial Sequence 

<220> . , 

<223> Description of Artificial Sequence; note = 

synthetic construct 

<400> 46 , , n 

ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60 

cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120 

gccaactcca tcactagggg ttcctggagg ggtggagtcg tgacgtgaat tacgtcatag 180 

ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat 240 

gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga 300 

ggtttgaacg cgcagccgcc atgccggggt tttacgagat tgtgattaag gtccccagcg 360 

accttgacgg gcatctgccc ggcatttctg acagctttgt gaactgggtg gccgagaagg 420 

aatgggagtt gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga 480 

ccgtggccga gaagctgcag cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc 540 

cggaggccct tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc 600 

tcgtggaaac caccggggtg aaatccatgg ttttgggacg tttcctgagt cagattcgcg 660 

aaaaactgat tcagagaatt taccgcggga tcgagccgac tttgccaaac tggttcgcgg 720 

tcacaaagac cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc 780 

ccaattactt gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac 840 
agtatttaag cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga 
cgcacgtgtc gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc 

cggtgatcag atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca 1020 

aggggattac ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca 1080 

atgcggcctc caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta 1140 

tgagcctgac taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt 1200 

ccagcaatcg gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt 1260 

ccgtctttct gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg l|20 

ggcctgcaac taccgggaag accaacatcg cggaggccat agcccacact gtgcccttct 1380 

acgggtgcgt aaactggacc aatgagaact ttcccttcaa cgactgtgtc gacaagatgg 1440 

tgatctggtg ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc 1500 

tcggaggaag caaggtgcgc gtggaccaga aatgcaagtc ctcggcccag atagacccga 1560 

ctcccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga 1620 

ccttcgaaca ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc 1680 

tggatcatga ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa 174U 
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aqqatcacgt ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa 1800 

gacccgcccc cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc 1860 

agccatcgac gtcagacgcg gaagcttcga tcaactacgc agacaggtac caaaacaaat 1920 

gttctcgtca cgtgggcatg aatctgatgc tgtttccctg cagacaatgc gagagaatga 1980 

atcagaattc aaatatctgc ttcactcacg gacagaaaga ctgtttagag tgctttcccg 2040 

tgtcagaatc tcaacccgtt tctgtcgtca aaaaggcgta tcagaaactg tgctacattc 2100 

atcatatcat gggaaaggtg ccagacgctt gcactgcctg cgatctggtc aatgtggatt 2160 

tggatgactg catctttgaa caataaatga tttaaatcag gtatggctgc cgatggttat 2220 

cttceagatt ggctcgagga cactctctct gaaggaataa gacagtggtg gaagctcaaa 2280 

cctggcccac caccaccaaa gcccgcagag cggcataagg acgacagcag gggtcttgtg 2340 

cttcctgggt acaagtacct cggacccttc aacggactcg acaagggaga gccggtcaac 2400 

gaggcagacg ccgcggccct cgagcacgta caaagcctac gaccggcagc tcgacagcgg 2460 

agacaacccg tacctcaagt acaaccacgc cgacgcggag tttcaggagc gccttaaaga 2520 

agatacgtct tttgggggca acctcggacg agcagtcttc caggcgaaaa agagggttct 2580 

tgaacctctg ggcctggttg aggaacctgt taagacggct ccgggaaaaa agaggccggt 2640 

agagcactct cctgtggagc cagactcctc ctcgggaacc ggaaaggcgg gccagcagcc zzoo 

tgcaagaaaa agattgaatt ttggtcagac tggagacgca gactcagtac ctgaccccca 2760 

gcctctcgga cagccaccag cagccccctc tggtctggga actaatacga tggctacagg 2820 

cagtggcgca ccaatggcag acaataacga gggcgccgac ggagtgggta attcctccgg 2880 

aaattggcat tgcgattcca catggatggg cgacagagtc atcaccacca gcacccgaac 2940 

ctqqqccctg cccacctaca acaaccacct ctacaaacaa atttccagcc aatcaggagc 3000 

ctcgaacgac aatcactact ttggctacag caccccttgg gggtattttg acttcaacag 3060 

attccactgc cacttttcac cacgtgactg gcaaagactc atcaacaaca actggggatt 3120 

ccgacccaag agactcaact tcaagctctt taacattcaa gtcaaagagg tcacgcagaa 3180 

tgacggtacg acgacgattg ccaataacct taccagcacg gttcaggtgt ttactgactc 3240 

qqaqtaccag ctcccgtacg tcctcggctc ggcgcatcaa ggatgcctcc cgccgttccc 3300 

agcagacgtc ttcatggtgc cacagtatgg atacctcacc ctgaacaacg ggagtcaggc 3360 

agtaggacgc tcttcatttt actgcctgga gtactttcct tctcagatgc tgcgtaccgg 3420 

aaacaacttt accttcagct acacttttga ggacgttcct ttccacagca gctacgctca 3480 

cagccagagt ctggaccgtc tcatgaatcc tctcatcgac cagtacctgt attacttgag 3540 

caqaacaaac actccaagtg gaaccaccac gcagtcaagg cttcagtttt ctcaggccgg 3600 

agcgagtgac attcgggacc agtctaggaa ctggcttcct ggaccctgtt accgccagca 3660 

gcgagtatca aagacatctg cggataacaa caacagtgaa tactcgtgga ctggagctac 3720 

caagtaccac ctcaatggca gagactctct ggtgaatccg gccatggcaa gccacaagga 3780 

cgatgaagaa aagttttttc ctcagagcgg ggttctcatc tttgggaagc aaggctcaga 3840 

gaaaacaaat gtgaacattg aaaaggtcat gattacagac gaagaggaaa tcggaacaac 3900 

caatcccgtg gctacggagc agtatggttc tgtatctacc aacctccaga gaggcaacag 3960 

acaagcagct accgcagatg tcaacacaca aggcgttctt ccaggcatgg tctggcagga 4020 

cagagatgtg taccttcagg ggcccatctg ggcaaagatt ccacacacgg acggacattt 4080 
tcacccctct cccctcatgg gtggattcgg acttaaacac cctcctccac agattctcat 
caagaacacc ccggtacctg cgaatccttc gaccaccttc agtgcggcaa agtttgcttc 

cttcatcaca cagtactcca cgggacacgg tcagcgtgga gatcgagtgg gagctgcaga 4260 

aggaaaacag caaacgctgg aatcccgaaa ttcagtacac ttccaactac aacaagtctg 4320 

ttaatcgtgg acttaccgtg gatactaatg gcgtgtattc agagcctcgc cccattggca 4380 

ccagatacct gactcgtaat ctgtaattgc ttgttaatca ataaaccgtt taattcgttt 4440 

cagttgaact ttggtctctg cgtatttctt tcttatctag tttccatggc tacgtagata 4500 

agtagcatgg cgggttaatc attaactaca aggaacccct agtgatggag ttggccactc 4560 

cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg 4620 

gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg gccaa 4b/ 5 

<210> 47 
<211> 4694 
<212> ONA 

<213> Artificial Sequence 
^2 2 0^ 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 47 a r\ 

qtggcactcc cccccctgtc gcgttcgctc gttcgctggc tcgattgggg gggtggcagc bo 

tcaaagagct gccagacgac ggccctctgg gccgtcgccc ccccaatcga Qccagcgaac l^u 

gagcgaacgc gacagggggg ggagtgccac actctctagc aagggggttt tgtaggtggt 180 
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gatgtcattg ttgatgtcat tatagttgtc acgcgatagt xaargaxtaa cagtcatgtg 240 

atgtgtgtta tccaatagga tgaaagcgcg cgaatgagat ctcgcgagac ttccggggta 300 

taaaaggggt gagtgaacga gcccgccgcc attctctgct ctggactgct agaggaccct 360 

cgctgccatg gctaccttct atgaagtcat tgttcgcgtt ccatttgatg tggaagagca 420 

cctgcctgga atttctgaca actttgtaga ctgggtaact ggtcaaattt gggagctgcc 480 

tcccgagtca gatttgaatt tgaCtctgat tgagcagcct cagctgacgg tggctgacag 540 

aattcgccgc gtgttcctgt acgagtggaa caaattttcc aagcaggaga gcaaattctt 600 

tgtgcagttt gaaaagggat ctgaatattt tcatctgcac acgctcgtgg agacctccgg 660 

catctcttct atggtccttg gccgctacgt gagtcagatt cgcgcccagc tggtgaaggt 720 

ggtgttccag aacattgagc cgcggattaa cgactgggtc gccatcacca aggtaaagaa 780 

gggcggagcc aataaggtgg tggattctgg gtatattccc gcctacctgc tgccgaaggt 840 

ccaaccagag cttcagtggg cgtggactaa cctcgaagag tataaattgg ccgccctcaa 900 

tctggaggag cgcaaacggc tcgtcgctca gtttcagctt gagtcctcgc agcgctcgca 960 

agaggcatct tcccagaggg acgtttcggc tgacccggtc atcaagagca agacttccca 1020 

gaaatacatg gcgctggtaa gctggctggt ggaacatggc atcacttccg agaagcagtg 1080 

gattcaggag aatcaggaga gctacctgtc cttcaactcc acgggaaact ctcggagcca 1140 

gattaaagcc gcgcttgaca acgcgtcaaa aattatgagt ctgaccaaat ctgcctcaga 1200 

ctatctcgtg ggacagactg ttccagagga catttctgaa aacagaatct ggcagatttt 1260 

tgatctcaac ggctacgacc cggcatacgc gggctctgtt ctctacggct ggtgcactcg 1320 

cgcctttgga aagaggaaca ccgtctggct gtatggaccc gcgaccaccg gaaagaccaa 1380 

catcgcggaa gccatctctc acaccgtgcc cttttatggc tgtgtgaact ggactaatga 1440 

gaactttccc tttaatgact gtgtggaaaa aatgttgatc tggtgggagg agggaaagat 1500 

gaccagcaag gtggtggaac ccgccaaggc catcttgggg gggtctagag tacgagtgga 1560 

tcaaaaatgt aaatcctctg tacaagtaga ctctaccccg gtgattatca cctccaatac 1620 

taacatgtgt gtggtggtgg atgggaactc cacgaccttt gaacaccagc agccgctgga 1680 

agaccgcatg ttcagatttg aactcatgcg gcggctcccg ccagattttg gcaagattac 1740 

caagcaggaa gtcaaagact tttttgcttg ggcaaaggtc aaccaggtgc cggtgactca 1800 

cgagtttatg gttcccaaga aagtggcggg aactgagagg gcggagactt ctagaaaacg 1860 

cccactggat gacgtcacca ataccaacta taaaagtccg gagaagcggg cccggctctc 1920 

agttgttcct gagacgcctc gcagttcaga cgtgcctgta gagcccgctc ctctgcgacc 1980 

tctcaactgg tcttccaggt atgaatgcag atgtgactat catgctaaat ttgactctgt 2040 

aacgggggaa tgtgacgagt gtgaatattt gaatcggggc aaaaatggct gtatctttca 2100 

taatgctaca cattgtcaaa tttgtcacgc tgttcctcca tgggaaaagg aaaatgtgtc 2160 

agattttaat gattttgatg actgtaataa agagcagtaa ataaagtgag tagtcatgtc 2220 

ttttgttgac caccctccag attggttgga atcgatcggc gacggctttc gtgaatttct 2280 

cggccttgag gcgggtcccc cgaaacccaa ggccaatcaa cagaagcaag ataacgctcg 2340 

aggtcttgtg cttcctgggt acaagtatct tggtcctggg aacggccttg ataagggcga 2400 

tcctgtcaat tttgctgacg aggttgcccg agagcacgac ctctcctacc agaaacagct 2460 

tgaggcgggc gataaccctt acctcaagta caaccacgcg gacgcagagt ttcaggagaa 2520 

actcgcttct gacacttctt ttgggggaaa ccttgggaag gctgttttcc aggctaaaaa 2580 

gaggattctc gaacctcttg gcctggttga gacgccggat aaaacggcgc ctgcggcaaa 2640 

aaagaggcct ctagagcaga gtcctcaaga gccagactcc tcgagcggag ttggcaagaa 2700 

aggcaaacag cctgccagaa agagactcaa ctttgacgac gaacctggag ccggagacgg 2760 

gcctccccca gaaggaccat cttccggagc tatgtctact gagactgaaa tgcgtgcagc 2820 

agctggcgga aatggtggcg atgcgggaca aggtgccgag ggagtgggta atgcctccgg 2880 

tgattggcat tgcgattcca cttggtcaga gagccacgtc accaccacct caacccgcac 2940 

ctgggtcctg ccgacctaca acaaccacct gtacctgcgg ctcggctcga gcaacgccag 3000 

cgacaccttc aacggattct ccaccccctg gggatacttt gactttaacc gcttccactg 3060 

ccacttctcg ccaagagact ggcaaaggct catcaacaac cactggggac tgcgccccaa 3120 

aagcatgcaa gtccgcatct tcaacatcca agttaaggag gtcacgacgt ctaacgggga 3180 

gacgaccgta tccaacaacc tcaccagcac ggtccagatc tttgcggaca gcacgtacga 3240 

gctcccgtac gtgatggatg caggtcagga gggcagcttg cctcctttcc ccaacgacgt 3300 

gttcatggtg cctcagtacg ggtactgcgg actggtaacc ggaggcagct ctcaaaacca 3360 

gacagacaga aatgccttct actgtctgga gtactttccc agccagatgc tgagaaccgg 3420 

aaacaacttt gagatggtgt acaagtttga aaacgtgccc ttccactcca tgtacgctca 3480 

cagccagagc ctggataggc tgatgaaccc gctgctggac cagtacctgt gggagctcca 3540 

gtctaccacc tctggaggaa ctctcaacca gggcaattca gccaccaact ttgccaagct 3600 

gaccaaaaca aacttttctg gctaccgcaa aaactggctc ccggggccca tgatgaagca 3660 

gcagagattc tccaagactg ccagtcaaaa ctacaagatt ccccagggaa gaaacaacag 3720 

tctgctccat tatgagacca gaactaccct cgacggaaga tggagcaatt ttgccccggg 3780 

aacggccatg gcaaccgcag ccaacgacgc caccgacttc tctcaggccc agctcatctt 3840 

tgcggggccc aacatcaccg gcaacaccac cacagatgcc aataacctga tgttcacttc 3900 

agaagatgaa cttagggcca ccaacccccg ggacactgac ctgtttggcc acctggcaac 3960 
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caaccagcaa aacgccacca ccgttcctac cgiagacgac gxggacggag tcggcgtgta 4020 

cccqqgaatg gtgtggcagg acagagacat ttactaccaa gggcccattt gggccaaaat 4080 

tccacacacg gatggacact ttcacccgtc tcctctcatt ggcggatttg gactgaaaag 4140 

cccqcctcca caaatattca tcaaaaacac tcctgtaccc gccaatcccg caacgacctt 4200 

ctctccggcc agaatcaaca gcttcatcac ccagtacagc accggacagg tggctgtcaa 4260 

aatagaatgg gaaatccaga aggagcggtc caagagatgg aacccagagg tccagttcac 4320 

gtccaactac ggagcacagg actcgcttct ctgggctccc gacaacgccg gagcctacaa 4380 

aqaqcccagg gccattggat cccgatacct caccaaccac ctctagccca attctgttgc 4440 

ataccctcaa taaaccgtgt attcgtttca gtaaaatact gcctcttgtg gtcattcggc 4500 

qtacaacagc ttacaacaac aacaaaaccc ccttgctaga gagtgtggca ctcccccccc 4560 

tqtcgcgttc gctcgttcgc tggctcgatt gggggggtgg cagctcaaag agctgccaga 4620 

cgacggccct ctgggccgtc gcccccccaa tcgagccagc gaacgagcga acgcgacagg 4680 

ggggggagtg ccac 4by4 

<210> 48 
<211> 1833 
<212> DNA 

<213> Artificial sequence 

<220> . . „ 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 48 , + , n 

atggctacct tctatgaagt cattgttcgc gttccatttg atgtggaaga gcacctgcct 60 

ggaatttctg acaactttgt agactgggta actggtcaaa tttgggagct gcctcccgag 120 

tcagatttga atttgactct gattgagcag cctcagctga cggtggctga cagaattcgc 180 

cgcgtgttcc tgtacgagtg gaacaaattt tccaagcagg agagcaaatt ctttgtgcag 240 

tttgaaaagg gatctgaata ttttcatctg cacacgctcg tggagacctc cggcatctct 300 

tctatggtcc ttggccgcta cgtgagtcag attcgcgccc agctggtgaa ggtggtgttc 360 

cagaacattg agccgcggat taacgactgg gtcgccatca ccaaggtaaa gaagggcgga 420 

gccaataagg tggtggattc tgggtatatt cccgcctacc tgctgccgaa ggtccaacca 480 

gagcttcagt gggcgtggac taacctcgaa gagtataaat tggccgccct caatctggag 540 

gagcgcaaac ggctcgtcgc tcagtttcag cttgagtcct cgcagcgctc gcaagaggca 600 

tcttcccaga gggacgtttc ggctgacccg gtcatcaaga gcaagacttc ccagaaatac 660 

atggcgctgg taagctggct ggtggaacat ggcatcactt ccgagaagca gtggattcag 720 

gagaatcagg agagctacct gtccttcaac tccacgggaa actctcggag ccagattaaa 780 

gccgcgcttg acaacgcgtc aaaaattatg agtctgacca aatctgcctc agactatctc 840 
gtgggacaga ctgttccaga ggacatttct gaaaacagaa tctggcagat ttttgatctc 
aacggctacg acccggcata cgcgggctct gttctctacg gctggtgcac tcgcgccttt 

ggaaagagga acaccgtctg gctgtatgga cccgcgacca ccggaaagac caacatcgcg 1020 

gaagccatct ctcacaccgt gcccttttat ggctgtgtga actggactaa tgagaacttt 1080 

ccctttaatg actgtgtgga aaaaatgttg atctggtggg aggagggaaa gatgaccagc 1140 

aaggtggtgg aacccgccaa ggccatcttg ggggggtcta gagtacgagt ggatcaaaaa 1200 

tgtaaatcct ctgtacaagt agactctacc ccggtgatta tcacctccaa tactaacatg 1260 

tgtgtggtgg tggatgggaa ctccacgacc tttgaacacc agcagccgct ggaagaccgc 1320 

atgttcagat ttgaactcat gcggcggctc ccgccagatt ttggcaagat taccaagcag 1380 

gaagtcaaag acttttttgc ttgggcaaag gtcaaccagg tgccggtgac tcacgagttt 1440 

atggttccca agaaagtggc gggaactgag agggcggaga cttctagaaa acgcccactg 1500 

gatgacgtca ccaataccaa ctataaaagt ccggagaagc gggcccggct ctcagttgtt 1560 

cctgagacgc ctcgcagttc agacgtgcct gtagagcccg ctcctctgcg acctctcaac 1620 

tggtcttcca ggtatgaatg cagatgtgac tatcatgcta aatttgactc tgtaacgggg 1680 

gaatgtgacg agtgtgaata tttgaatcgg ggcaaaaatg gctgtatctt tcataatgct 1740 

acacattgtc aaatttgtca cgctgttcct ccatgggaaa aggaaaatgt gtcagatttt 1800 

aatgattttg atgactgtaa taaagagcag taa i"3 

<210> 49 
<211> 610 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
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synthetic construct 
<400> 49 

Met Ala Thr Phe Tyr Glu val He val Arg val Pro Phe Asp Val Glu 

1 5 10 15 

Glu His Leu Pro Gly lie Ser Asp Asn Phe val Asp Trp val Thr Gly 

20 25 30 

Gin lie Trp Glu Leu Pro Pro Glu ser Asp Leu Asn Leu Thr Leu He 

35 40 45 

Glu Gin Pro Gin Leu Thr Val Ala Asp Arg He Arg Arg Val Phe Leu 

50 55 60 

Tyr Glu Trp Asn Lys Phe ser Lys Gin Glu ser Lys Phe Phe Val Gin 
65 70 75 80 

Phe Glu Lys Gly Ser Glu Tyr Phe His Leu His Thr Leu Val Glu Thr 

85 90 95 

ser Gly He ser ser Met val Leu Gly Arg Tyr val ser Gin He Arg 

100 105 HO 

Ala Gin Leu Val Lys val Val Phe Gin Asn lie Glu Pro Arg He Asn 

115 120 125 

asp Trp Val Ala He Thr Lys Val Lys Lys Gly Gly Ala Asn Lys Val 

130 135 140 

val Asp Ser Gly Tyr lie Pro Ala Tyr Leu Leu Pro Lys Val Gin Pro 
145 * 150 155 160 

Glu Leu Gin Trp Ala Trp Thr Asn Leu Glu Glu Tyr Lys Leu Ala Ala 

165 170 175 

Leu Asn Leu Glu Glu Arg Lys Arg Leu Val Ala Gin Phe Gin Leu Glu 

180 185 190 

Ser Ser Gin Arg ser Gin Glu Ala Ser ser Gin Arg Asp val ser Ala 

195 200 205 

Asp Pro val He Lys Ser Lys Thr ser Gin Lys Tyr Met Ala Leu Val 

210 215 220 

ser Trp Leu Val Glu His Gly He Thr Ser Glu Lys Gin Trp lie Gin 
225 230 235 240 

Glu Asn Gin Glu Ser Tyr Leu ser Phe Asn Ser Thr Gly Asn Ser Arg 

245 250 255 

Ser Gin lie Lys Ala Ala Leu Asp Asn Ala Ser Lys lie Met Ser Leu 

260 265 270 

Thr Lys Ser Ala Ser Asp Tyr Leu val Gly Gin Thr val Pro Glu Asp 

275 280 285 

He ser Glu Asn Arg He Trp Gin lie Phe Asp Leu Asn Gly Tyr Asp 

290 ~ 295 300 

Pro Ala Tyr Ala Gly ser val Leu Tyr Gly Trp cys Thr Arg Ala Phe 
305 310 315 320 

Gly Lys Arg Asn Thr val Trp Leu Tyr Gly Pro Ala Thr Thr Gly Lys 

325 330 335 

Thr Asn He Ala Glu Ala He ser His Thr val Pro Phe Tyr Gly cys 

340 345 350 

val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp Cys Val Glu Lys 

355 360 365 

Met Leu He Trp Trp Glu Glu Gly Lys Met Thr Ser Lys Val val Glu 

370 375 380 

Pro Ala Lys Ala He Leu Gly Gly ser Arg val Arg Val Asp Gin Lys 
385 390 395 400 

cys Lys Ser Ser val Gin val Asp ser Thr Pro val He He Thr ser 

y ' 405 410 415 

Asn Thr Asn Met cys val Val Val Asp Gly Asn Ser Thr Thr Phe Glu 

420 425 430 

His Gin Gin Pro Leu Glu Asp Arg Met Phe Arg Phe Glu Leu Met Arg 

435 440 445 

Arg Leu Pro Pro Asp phe Gly Lys lie Thr Lys Gin Glu val Lys Asp 

450 455 " 460 

Phe Phe Ala Trp Ala Lys val Asn Gin val Pro val Thr His Glu Phe 
465 470 475 480 
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Met val Pro Lys Lys Val Ala Gly Thr Glu Arg Ala Glu Thr Ser Arg 

485 490 495 

Lys Arg Pro Leu Asp Asp Val Thr Asn Thr Asn Tyr Lys Ser Pro Glu 

500 505 510 

Lys Arg Ala Arg Leu ser val val Pro Glu Thr Pro Arg Ser ser Asp 

515 520 525 

Val Pro val Glu Pro Ala Pro Leu Arg Pro Leu Asn Trp Ser Ser Arg 

530 535 540 

Tyr Glu Cys Arg cys Asp Tyr His Ala Lys Phe Asp Ser Val Thr Gly 
545 ~ 550 555 560 

Glu cys Asp Glu Cys Glu Tyr Leu Asn Arg Gly Lys Asn Gly cys He 

565 570 575 

Phe His Asn Ala Thr His cys Gin lie cys His Ala Val Pro Pro Trp 

580 585 590 

Glu Lys Glu Asn val Ser Asp Phe Asn Asp Phe Asp Asp cys Asn Lys 
595 600 605 

Glu Gin 
610 

<210> 50 
<211> 1173 
<212> DNA 

<213> Artificial sequence 

<220> . , 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 50 ^ _ cn 

atggcgctgg taagctggct ggtggaacat ggcatcactt ccgagaagca gtggattcag 60 

gagaatcagg agagctacct gtccttcaac tccacgggaa actctcggag ccagattaaa 120 

gccgcgcttg acaacgcgtc aaaaattatg agtctgacca aatctgcctc agactatctc 180 

qtgggacaga ctgttccaga ggacatttct gaaaacagaa tctggcagat ttttgatctc 240 

aacggctacg acccggcata cgcgggctct gttctctacg gctggtgcac tcgcgccttt 300 

ggaaagagga acaccgtctg gctgtatgga cccgcgacca ccggaaagac caacatcgcg 360 

gaagccatct ctcacaccgt gcccttttat ggctgtgtga actggactaa tgagaacttt 420 

ccctttaatg actgtgtgga aaaaatgttg atctggtggg aggagggaaa gatgaccagc 450 

aaggtggtgg aacccgccaa ggccatcttg ggggggtcta gagtacgagt ggatcaaaaa b40 

tgtaaatcct ctgtacaagt agactctacc ccggtgatta tcacctccaa tactaacatg 600 

tgtgtggtgg tggatgggaa ctccacgacc tttgaacacc agcagccgct ggaagaccgc 660 

atgttcagat ttgaactcat gcggcggctc ccgccagatt ttggcaagat taccaagcag 7ZO 

gaagtcaaag acttttttgc ttgggcaaag gtcaaccagg tgccggtgac tcacgagttt 780 

atggttccca agaaagtggc gggaactgag agggcggaga cttctagaaa acgcccactg 840 

gatgacgtca ccaataccaa ctataaaagt ccggagaagc gggcccggct ctcagttgtt 900 

cctgagacgc ctcgcagttc agacgtgcct gtagagcccg ctcctctgcg acctctcaac 960 

tggtcttcca ggtatgaatg cagatgtgac tatcatgcta aatttgactc tgtaacgggg 1020 

gaatgtgacg agtgtgaata tttgaatcgg ggcaaaaatg gctgtatctt tcataatgct 1080 

acacattgtc aaatttgtca cgctgttcct ccatgggaaa aggaaaatgt gtcagatttt 1140 

aatgattttg atgactgtaa taaagagcag taa 117 * 

<210> 51 
<211> 390 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 51 , 

Met Ala Leu Val Ser Trp Leu Val Glu His Gly lie Thr ser Glu Lys 
Gin Trp lie Gin Glu Asn Gin Glu Ser Tyr Leu ser Phe Asn ser Thr 
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20 25 30 

Gly Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala Ser Lys 

35 40 45 

lie Met Ser Leu Thr Lys ser Ala ser Asp Tyr Leu val Gly Gin Thr 

50 55 60 

val Pro Glu Asp He ser Glu Asn Arg lie Trp Gin lie Phe Asp Leu 
65 70 75 80 

Asn Gly Tyr Asp Pro Ala Tyr Ala Gly Ser Val Leu Tyr Gly Trp cys 

85 90 95 

Thr Arg Ala Phe Gly Lys Arg Asn Thr Val Trp Leu Tyr Gly Pro Ala 

100 " 105 110 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala lie ser His Thr val pro 

115 120 125 

Phe Tyr Gly cys val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

130 135 140 

Cvs Val Glu Lys Met Leu lie Trp Trp Glu Glu Gly Lys Met Thr ser 
145 150 155 160 

Lys val val Glu Pro Ala Lys Ala lie Leu Gly Gly Ser Arg val Arg 

165 170 175 

val Asp Gin Lys cys Lys Ser Ser Val Gin val Asp ser Thr Pro Val 

180 185 1?0 

He He Thr Ser Asn Thr Asn Met Cys Val Val val Asp Gly Asn ser 

195 200 205 

Thr Thr Phe Glu His Gin Gin Pro Leu Glu Asp Arg Met Phe Arg Phe 

210 215 220 

Glu Leu Met Arg Arg Leu Pro Pro Asp Phe Gly Lys lie Thr Lys Gin 
225 230 235 240 

Glu val Lys Asp Phe Phe Ala Trp Ala Lys val Asn Gin val Pro val 

245 250 255 

Thr His Glu Phe Met Val Pro Lys Lys val Ala Gly Thr Glu Arg Ala 

260 265 270 

Glu Thr ser Arg Lys Arg Pro Leu Asp Asp val Thr Asn Thr Asn Tyr 

275 " 280 285 

Lvs Ser Pro Glu Lys Arg Ala Arg Leu Ser Val val Pro Glu Thr Pro 

290 295 300 

Arg Ser ser Asp val Pro Val Glu Pro Ala Pro Leu Arg Pro Leu Asn 
305 310 315 320 

Trp Ser Ser Arg Tyr Glu Cys Arg Cys Asp Tyr His Ala Lys Phe Asp 

325 330 335 

Ser val Thr Gly Glu cys Asp Glu Cys Glu Tyr Leu Asn Arg Gly Lys 

340 345 350 

Asn Gly Cys He Phe His Asn Ala Thr His Cys Gin lie cys His Ala 

355 360 365 

val Pro Pro Trp Glu Lys Glu Asn val ser Asp Phe Asn Asp Phe Asp 

370 375 380 

Asp Cys Asn Lys Glu Gin 
385 390 

<210> 52 
<211> 2211 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 52 ^ ^ cn 

atgtcttttg ttgaccaccc tccagattgg ttggaatcga tcggcgacgg ctttcgtgaa 60 

tttctcggcc ttgaggcggg tcccccgaaa cccaaggcca atcaacagaa gcaagataac izu 

gctcgaggtc ttgtgcttcc tgggtacaag tatcttggtc ctgggaacgg ccttgataag 180 

ggcgatcctg tcaattttgc tgacgaggtt gcccgagagc acgacctctc ctaccagaaa 240 

cagcttgagg cgggcgataa cccttacctc aagtacaacc acgcggacgc agagtttcag 300 
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900 
960 



gagaaactcg cttctgacac ttcttttggg ggaaaccttg ggaaggctgt tttccaggct 360 

aaaaagagga ttctcgaacc tcttggcctg gttgagacgc cggataaaac ggcgcctgcg 420 

gcaaaaaaga ggcctctaga gcagagtcct caagagccag actcctcgag cggagttggc 480 

aagaaaggca aacagcctgc cagaaagaga ctcaactttg acgacgaacc tggagccgga 540 

gacgggcctc ccccagaagg accatcttcc ggagctatgt ctactgagac tgaaatgcgt 600 

gcagcagctg gcggaaatgg tggcgatgcg ggacaaggtg ccgagggagt gggtaatgcc 660 

tccggtgatt ggcattgcga ttccacttgg tcagagagcc acgtcaccac cacctcaacc 720 

cgcacctggg tcctgccgac ctacaacaac cacctgtacc tgcggctcgg ctcgagcaac 780 

gccagcgaca ccttcaacgg attctccacc ccctggggat actttgactt taaccgcttc 840 
cactgccact tctcgccaag agactggcaa aggctcatca acaaccactg gggactgcgc 
cccaaaagca tgcaagtccg catcttcaac atccaagtta aggaggtcac gacgtctaac 

ggggagacga ccgtatccaa caacctcacc agcacggtcc agatctttgc ggacagcacg 1020 

tacgagctcc cgtacgtgat ggatgcaggt caggagggca gcttgcctcc tttccccaac 1080 

gacgtgttca tggtgcctca gtacgggtac tgcggactgg taaccggagg cagctctcaa 1140 

aaccagacag acagaaatgc cttctactgt ctggagtact ttcccagcca gatgctgaga 1200 

accggaaaca actttgagat ggtgtacaag tttgaaaacg tgcccttcca ctccatgtac 1260 

gctcacagcc agagcctgga taggctgatg aacccgctgc tggaccagta cctgtgggag 1320 

ctccagtcta ccacctctgg aggaactctc aaccagggca attcagccac caactttgcc 1380 

aagctgacca aaacaaactt ttctggctac cgcaaaaact ggctcccggg gcccatgatg 1440 

aagcagcaga gattctccaa gactgccagt caaaactaca agattcccca gggaagaaac 1500 

aacagtctgc tccattatga gaccagaact accctcgacg gaagatggag caattttgcc 1560 

ccgggaacgg ccatggcaac cgcagccaac gacgccaccg acttctctca ggcccagctc 1620 

atctttgcgg ggcccaacat caccggcaac accaccacag atgccaataa cctgatgttc 1680 

acttcagaig atgaacttag ggccaccaac ccccgggaca ctgacctgtt tggccacctg 1740 

gcaaccaacc agcaaaacgc caccaccgtt cctaccgtag acgacgtgga cggagtcggc 1800 

atatacccgg gaatggtgtg gcaggacaga gacatttact accaagggcc catttgggcc 1860 

aaiattccac Icacggatgg acactttcac ccgtctcctc tcattggcgg atttggactg 1920 

aaaagcccgc ctccacaaat attcatcaaa aacactcctg tacccgccaa tcccgcaacg 1980 

accttctctc cggccagaat caacagcttc atcacccagt acagcaccgg acaggtggct 2040 

gtcaaaatag aatgggaaat ccagaaggag cggtccaaga gatggaaccc agaggtccag 2100 

ttcacgtcca actacggagc acaggactcg cttctctggg ctcccgacaa cgccggagcc 2160 

tacaaagagc ccagggccat tggatcccga tacctcacca accacctcta g 2^i± 

<210> 53 
<211> 736 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 53 _ — ^ . 

Met ser Phe val Asp His pro Pro Asp Trp Leu Glu Ser He Gly Asp 
1 5 10 15 

Gly Phe Arg Glu Phe Leu Gly Leu Glu Ala Gly Pro Pro Lys Pro Lys 

20 25 30 

Ala Asn Gin Gin Lys Gin Asp Asn Ala Arg Gly Leu val Leu Pro Gly 

35 40 45 

Tvr Lvs Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Asp Pro val 

50 55 60 

Asn Phe Ala Asp Glu val Ala Arg Glu His Asp Leu Ser Tyr Gin Lys 
65 70 75 80 

Gin Leu Glu Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala Asp 

85 90 95 

Ala Glu Phe Gin Glu Lys Leu Ala Ser Asp Thr Ser Phe Gly Gly Asn 

100 105 110 

Leu Gly Lys Ala val Phe Gin Ala Lys Lys Arg lie Leu Glu Pro Leu 

115 120 , 1?5 

Gly Leu val Glu Thr Pro Asp Lys Thr Ala Pro Ala Ala Lys Lys Arg 

130 135 I 40 

pro Leu Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly val Gly 
145 150 155 160 
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lvs Lys Gly Lys Gin Pro Ala Arg Lys Arg Leu Asn Phe Asp Asp Glu 
3 165 170 175 

Pro Gly Ala Gly Asp Gly Pro Pro Pro Glu Gly Pro Ser ser Gly Ala 

Met ser Thr Glu Thr Glu Met Arg Ala Ala Ala Gly Gly Asn Gly Gly 

195 200 205 

asd Ala Gly Gin Gly Ala Glu Gly val Gly Asn Ala Ser Gly Asp Trp 

210 215 220 

His cys Asp Ser Thr Trp Ser Glu ser His val Thr Thr Thr Ser Thr 
225 230 235 240 

Arq Thr Trp val Leu Pro Thr Tyr Asn Asn His Leu Tyr Leu Arg Leu 

245 250 255 

Gly ser Ser Asn Ala Ser Asp Thr Phe Asn Gly Phe Ser Thr Pro Trp 

y 260 265 270 

Glv Tyr Phe Asp Phe Asn Arg Phe His Cys His Phe ser Pro Arg Asp 

y 275 280 285 

Trp Gin Arg Leu lie Asn Asn His Trp Gly Leu Arg Pro Lys ser Met 

290 295 300 

Gin Val Arg lie Phe Asn He Gin val Lys Glu val Thr Thr Ser Asn 
305 310 315 320 

Glv Glu Thr Thr Val ser Asn Asn Leu Thr Ser Thr val Gin lie Phe 

325 330 335 

Ala Asp Ser Thr Tyr Glu Leu Pro Tyr Val Met Asp Ala Gly Gin Glu 

340 345 350 

Glv Ser Leu Pro Pro Phe Pro Asn Asp Val Phe Met val Pro Gin Tyr 

355 360 365 

Glv Tyr cys Gly Leu val Thr Gly Gly ser Ser Gin Asn Gin Thr Asp 

370 375 380 

Arg Asn Ala Phe Tyr cys Leu Glu Tyr Phe Pro Ser Gin Met Leu Arg 
385 390 395 400 

Thr Gly Asn Asn Phe Glu Met val Tyr Lys Phe Glu Asn val Pro Phe 

405 410 415 

His ser Met Tyr Ala His ser Gin Ser Leu Asp Arg Leu Met Asn Pro 

420 425 430 

Leu Leu Asp Gin Tyr Leu Trp Glu Leu Gin Ser Thr Thr Ser Gly Gly 

435 440 445 

Thr Leu Asn Gin Gly Asn ser Ala Thr Asn Phe Ala Lys Leu Thr Lys 

450 455 460 

Thr Asn Phe ser Gly Tyr Arg Lys Asn Trp Leu Pro Gly Pro Met Met 
465 470 ~ 475 480 

Lys Gin Gin Arg Phe ser Lys Thr Ala ser Gin Asn Tyr Lys lie Pro 

485 490 495 

Gin Gly Arg Asn Asn ser Leu Leu His Tyr Glu Thr Arg Thr Thr Leu 

500 505 510 

asd Gly Arg Trp Ser Asn Phe Ala Pro Gly Thr Ala Met Ala Thr Ala 

515 520 525 

Ala Asn Asp Ala Thr Asp Phe ser Gin Ala Gin Leu lie Phe Ala Gly 

530 * 535 540 

Pro Asn lie Thr Gly Asn Thr Thr Thr Asp Ala Asn Asn Leu Met Phe 
545 550 555 560 

Thr ser Glu Asp Glu Leu Arg Ala Thr Asn Pro Arg Asp Thr Asp Leu 

565 570 575 

Phe Gly His Leu Ala Thr Asn Gin Gin Asn Ala Thr Thr Val Pro Thr 

580 585 590 

val Asp Asp val Asp Gly Val Gly val Tyr Pro Gly Met Val Trp Gin 

595 600 n 605 

Asp Arg Asp lie Tyr Tyr Gin Gly Pro lie Trp Ala Lys lie Pro His 

610 ' 615 620 

Thr Asp Gly His Phe His Pro ser Pro Leu lie Gly Gly Phe Gly Leu 
625 630 635 640 

Lys Ser Pro Pro Pro Gin lie Phe lie Lys Asn Thr Pro val Pro Ala 

645 650 655 

Asn Pro Ala Thr Thr phe ser Pro Ala Arg lie Asn Ser Phe lie Thr 
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660 665 ^ 670 

Gin Tyr Ser Thr Gly Gin Val Ala val .Lys lie Glu Trp Glu lie Gin 

675 680 685 

Lys Glu Arg Ser Lys Arg Trp Asn Pro Glu Val Gin Phe Thr Ser Asn 

690 695 700 

Tvr Gly Ala Gin Asp Ser Leu Leu Trp Ala Pro Asp Asn Ala Gly Ala 
705 710 715 . 720 

Tvr Lys Glu Pro Arg Ala lie Gly ser Arg Tyr Leu Thr Asn His Leu 
y y 725 730 735 

<210> 54 
<211> 1803 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 54 _ ^ „ 

acggcgcctg cggcaaaaaa gaggcctcta gagcagagtc ctcaagagcc agactcctcg 60 
agcggagttg gcaagaaagg caaacagcct gccagaaaga gactcaactt tgacgacgaa 120 
cctggagccg gagacgggcc tcccccagaa ggaccatctt ccggagctat gtctactgag 180 
actgaaatgc gtgcagcagc tggcggaaat ggtggcgatg cgggacaagg tgccgaggga 240 
gtgggtaatg cctccggtga ttggcattgc gattccactt ggtcagagag ccacgtcacc 300 
accacctcaa cccgcacctg ggtcctgccg acctacaaca accacctgta cctgcggctc 360 
ggctcgagca acgccagcga caccttcaac ggattctcca ccccctgggg atactttgac 420 
tttaaccgct tccactgcca cttctcgcca agagactggc aaaggctcat caacaaccac 480 
tggggactgc gccccaaaag catgcaagtc cgcatcttca acatccaagt taaggaggtc 540 
acgacgtcta acggggagac gaccgtatcc aacaacctca ccagcacggt ccagatcttt 600 
gcggacagca cgtacgagct cccgtacgtg atggatgcag gtcaggaggg cagcttgcct 660 
cctttcccca acgacgtgtt catggtgcct cagtacgggt actgcggact ggtaaccgga 720 
ggcagctctc aaaaccagac agacagaaat gccttctact gtctggagta ctttcccagc 780 
cagatgctga gaaccggaaa caactttgag atggtgtaca agtttgaaaa cgtgcccttc 
cactccatgt acgctcacag ccagagcctg gataggctga tgaacccgct gctggaccag 
tacctgtggg agctccagtc taccacctct ggaggaactc tcaaccaggg caattcagcc 
accaactttg ccaagctgac caaaacaaac ttttctggct accgcaaaaa ctggctcccg 
gggcccatga tgaagcagca gagattctcc aagactgcca gtcaaaacta caagattccc 
cagggaagaa acaacagtct gctccattat gagaccagaa ctaccctcga cggaagatgg 1140 
agcaattttg ccccgggaac ggccatggca accgcagcca acgacgccac cgacttctct 1200 
caggcccagc tcatctttgc ggggcccaac atcaccggca acaccaccac agatgccaat 1260 
aacctgatgt tcacttcaga agatgaactt agggccacca acccccggga cactgacctg 1320 
tttggccacc tggcaaccaa ccagcaaaac gccaccaccg ttcctaccgt agacgacgtg 1380 
gacggagtcg gcgtgtaccc gggaatggtg tggcaggaca gagacattta ctaccaaggg 1440 
cccatttggg ccaaaattcc acacacggat ggacactttc acccgtctcc tctcattggc 1500 
ggatttggac tgaaaagccc gcctccacaa atattcatca aaaacactcc tgtacccgcc 1560 
aatcccgcaa cgaccttctc tccggccaga atcaacagct tcatcaccca gtacagcacc 1620 
ggacaggtgg ctgtcaaaat agaatgggaa atccagaagg agcggtccaa gagatggaac 1680 
ccagaggtcc agttcacgtc caactacgga gcacaggact cgcttctctg ggctcccgac 1740 
aacgccggag cctacaaaga gcccagggcc attggatccc gatacctcac caaccacctc 
tag 

<210> 55 
<211> 600 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 55 „ , - 

Thr Ala Pro Ala Ala Lys Lys Arg Pro Leu Glu Gin Ser Pro Gin Glu 



840 
900 
960 
1020 
1080 



1800 
1803 



49/60 



WO 2006/029196 PCT/US2005/031837 



15 10 15 

Pro Asp Ser Ser ser Gly val Gly Lys Lys Gly Lys Gin Pro Ala Arg 

20 25 30 

Lys Arg Leu Asn Phe Asp Asp Glu Pro Gly Ala Gly Asp Gly Pro Pro 

35 40 45 

pro Glu Gly Pro Ser ser Gly Ala Met Ser Thr Glu Thr Glu Met Arg 

50 55 60 

Ala Ala Ala Gly Gly Asn Gly Gly Asp Ala Gly Gin Gly Ala Glu Gly 
65 70 75 80 

Val Gly Asn Ala Ser Gly Asp Trp His cys Asp Ser Thr Trp Ser Glu 

85 90 95 

Ser His val Thr Thr Thr ser Thr Arg Thr Trp Val Leu Pro Thr Tyr 

100 105 . 110 

Asn Asn His Leu Tyr Leu Arg Leu Gly ser ser Asn Ala Ser Asp Thr 

115 120 125 

Phe Asn Gly Phe Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 

130 135 140 

His cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn His 
145 150 " 155 160 

Trp Gly Leu Arg Pro Lys Ser Met Gin Val Arg lie Phe Asn lie Gin 

165 170 175 

Val Lys Glu Val Thr Thr Ser Asn Gly Glu Thr Thr val Ser Asn Asn 

180 185 190 

Leu Thr Ser Thr Val Gin lie Phe Ala Asp ser Thr Tyr Glu Leu Pro 

195 200 205 

Tyr Val Met Asp Ala Gly Gin Glu Gly Ser Leu Pro Pro Phe Pro Asn 

210 215 220 

Asp val Phe Met Val Pro Gin Tyr Gly Tyr Cys Gly Leu Val Thr Gly 
225 230 235 240 

Gly ser ser Gin Asn Gin Thr Asp Arg Asn Ala Phe Tyr cys Leu Glu 

245 250 255 

Tvr Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Met val 

260 265 270 

Tyr Lys Phe Glu Asn val Pro Phe His ser Met Tyr Ala His ser Gin 

275 280 285 

ser Leu Asp Arg Leu Met Asn Pro Leu Leu Asp Gin Tyr Leu Trp Glu 

290 295 300 

Leu Gin Ser Thr Thr Ser Gly Gly Thr Leu Asn Gin Gly Asn Ser Ala 
305 310 315 320 

Thr Asn Phe Ala Lys Leu Thr Lys Thr Asn Phe ser Gly Tyr Arg Lys 

325 330 335 

Asn Trp Leu Pro Gly Pro Met Met Lys Gin Gin Arg Phe ser Lys Thr 

340 345 350 

Ala Ser Gin Asn Tyr Lys lie Pro Gin Gly Arg Asn Asn Ser Leu Leu 

355 360 365 

His Tyr Glu Thr Arg Thr Thr Leu Asp Gly Arg Trp Ser Asn Phe Ala 

370 ~ 375 380 

Pro Gly Thr Ala Met Ala Thr Ala Ala Asn Asp Ala Thr Asp Phe Ser 
385 390 395 400 

Gin Ala Gin Leu He Phe Ala Gly Pro Asn He Thr Gly Asn Thr Thr 

405 410 n 415 n 

Thr Asp Ala Asn Asn Leu Met Phe Thr Ser Glu Asp Glu Leu Arg Ala 

420 425 430 

Thr Asn Pro Arg Asp Thr Asp Leu Phe Gly His Leu Ala Thr Asn Gin 

435 440 445 i 

Gin Asn Ala Thr Thr val Pro Thr val Asp Asp Val Asp Gly val Gly 

450 455 460 

val Tyr Pro Gly Met val Trp Gin Asp Arg Asp lie Tyr Tyr Gin Gly 
465 470 475 480 

Pro lie Trp Ala Lys lie Pro His Thr Asp Gly His Phe His Pro ser 

485 490 495 

Pro Leu lie Gly Gly Phe Gly Leu Lys ser Pro Pro Pro Gin lie Phe 

500 505 510 
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lie Lys Asn Thr Pro val Pro Ala Asn Pro Ala Thr Thr Phe Ser Pro 

515 520 525 

Ala Arg lie Asn Ser Phe lie Thr Gin Tyr Ser Thr Gly Gin Val Ala 

530 535 540 

Val Lys lie Glu Trp Glu lie Gin Lys Glu Arg ser Lys Arg Trp Asn 
545 550 555 560 

Pro Glu val Gin Phe Thr Ser Asn Tyr Gly Ala Gin Asp ser Leu Leu 

565 570 575 

Tro Ala Pro Asp Asn Ala Gly Ala Tyr Lys Glu Pro Arg Ala He Gly 

580 585 590 

ser Arg Tyr Leu Thr Asn His Leu 
595 600 

<210> 56 
<211> 1617 
<212> DNA 

<213> Artificial sequence 
^ 2 2 0^ 

<223> Description of Artificial sequence; note = 
synthetic construct 

atgcgtgcag cagctggcgg aaatggtggc gatgcgggac aaggtgccga gggagtgggt 60 

aatgcctccg gtgattggca ttgcgattcc acttggtcag agagccacgt caccaccacc 120 

tcaacccgca cctgggtcct gccgacctac aacaaccacc tgtacctgcg gctcggctcg 180 

agcaacgcca gcgacacctt caacggattc tccaccccct ggggatactt tgactttaac 240 

cgcttccact gccacttctc gccaagagac tggcaaaggc tcatcaacaa ccactgggga 300 

ctgcgcccca aaagcatgca agtccgcatc ttcaacatcc aagttaagga ggtcacgacg 360 

tctaacgggg agacgaccgt atccaacaac ctcaccagca cggtccagat ctttgcggac 420 

agcacgtacg agctcccgta cgtgatggat gcaggtcagg agggcagctt gcctcctttc 480 

cccaacgacg tgttcatggt gcctcagtac gggtactgcg gactggtaac cggaggcagc 540 

tctcaaaacc agacagacag aaatgccttc tactgtctgg agtactttcc cagccagatg 600 
ctgagaaccg gaaacaactt tgagatggtg tacaagtttg aaaacgtgcc cttccactcc 

atgtacgctc acagccagag cctggatagg ctgatgaacc cgctgctgga ccagtacctg 720 
tgggagctcc agtctaccac ctctggagga actctcaacc agggcaattc agccaccaac 
tttgccaagc tgaccaaaac aaacttttct ggctaccgca aaaactggct cccggggccc 
atgatgaagc agcagagatt ctccaagact gccagtcaaa actacaagat tccccaggga 
agaaacaaca gtctgctcca ttatgagacc agaactaccc tcgacggaag atggagcaat 

tttgccccgg gaacggccat ggcaaccgca gccaacgacg ccaccgactt ctctcaggcc 1020 

cagctcatct ttgcggggcc caacatcacc ggcaacacca ccacagatgc caataacctg 1080 

atgttcactt cagaagatga acttagggcc accaaccccc gggacactga cctgtttggc 1140 

cacctggcaa ccaaccagca aaacgccacc accgttccta ccgtagacga cgtggacgga 1200 

gtcggcgtgt acccgggaat ggtgtggcag gacagagaca tttactacca agggcccatt 1260 

tgggccaaaa ttccacacac ggatggacac tttcacccgt ctcctctcat tggcggattt 1320 

qqactgaaaa gcccgcctcc acaaatattc atcaaaaaca ctcctgtacc cgccaatccc 1380 

gcaacgacct tctctccggc cagaatcaac agcttcatca cccagtacag caccggacag 1440 

qtggctgtca aaatagaatg ggaaatccag aaggagcggt ccaagagatg gaacccagag 1500 

gtccagttca cgtccaacta cggagcacag gactcgcttc tctgggctcc cgacaacgcc 1560 

ggagcctaca aagagcccag ggccattgga tcccgatacc tcaccaacca cctctag ibl7 

<210> 57 
<211> 538 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 57 , _ ^ A n 

Met Arg Ala Ala Ala Gly Gly Asn Gly Gly Asp Ala Gly Gin Gly Ala 
1 5 10 15 



660 
720 
780 
840 
900 
960 
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Glu Gly val 

Ser Glu Ser 
35 

Thr Tyr Asn 
50 

Asp Thr Phe 

65 

Arg Phe His 

Asn His Trp 

lie Gin val 
115 

Asn Asn Leu 

130 
Leu Pro Tyr 
145 

Pro Asn Asp 



Gly Asn 
20 

His Val 

Asn His 

Asn Gly 

Cys His 

85 
Gly Leu 
100 

Lys Glu 
Thr Ser 
Val Met 



Ala ser Gly Asp Trp His cys Asp 

25 

ser Thr Arg 



val Phe 
165 

Thr Gly Gly ser Ser 

180 

Phe Pro 



Leu Glu Tyr 
195 

Met val Tyr 

210 
Ser Gin ser 
225 

Trp Glu Leu 

Ser Ala Thr 

Arg Lys Asn 
275 

Lys Thr Ala 

290 
Leu Leu His 
305 

Phe Ala Pro 

Phe Ser Gin 

Thr Thr Thr 
355 

Arg Ala Thr 

370 
Asn Gin Gin 
385 

Val Gly val 

Gin Gly Pro 

pro Ser Pro 
435 

lie Phe lie 

450 
ser Pro Ala 
465 

Val Ala val 
Trp Asn Pro 
Leu Leu Trp 



Lys Phe 

Leu Asp 

Gin ser 
245 
Asn Phe 
260 

Trp Leu 

ser Gin 

Tyr Glu 

Gly Thr 
. 325 
Ala Gin 
340 

Asp Ala 
Asn Pro 
Asn Ala 



Thr Thr Thr 
40 

Leu Tyr Leu 
55 

Phe ser Thr 
70 

Phe ser Pro 

Arg Pro Lys 

Val Thr Thr 
120 

Thr Val Gin 
135 

Asp Ala Gly 
150 

Met Val Pro 

Gin Asn Gin 

ser Gin Met 
200 

Glu Asn Val 
215 

Arg Leu Met 
230 

Thr Thr ser 



Thr Trp 
45 

Arg Leu Gly ser ser 

60 

Tyr Phe 



pro Trp Gly 
75 

Arg Asp Trp 
90 

Ser Met Gin 
105 

ser Asn Gly 

lie Phe Ala 

Gin Glu Gly 
155 

Gin Tyr Gly 

170 
Thr Asp Arg 
185 

Leu Arg Thr 
Pro Phe His 



Gin Arg 

val Arg 

Glu Thr 
125 
Asp ser 
140 

ser Leu 
Tyr Cys 
Asn Ala 



ser Thr Trp 
30 

val Leu Pro 

Asn Ala Ser 

Asp Phe Asn 
80 

Leu lie Asn 
95 

lie Phe Asn 
110 

Thr Val ser 



Thr Tyr 

pro Pro 

Gly Leu 
175 
Phe Tyr 
190 

Asn Phe 



Glu 

Phe 
160 
Val 

cys 

Glu 



Asn Pro Leu 
235 

Gly Gly Thr 
250 

Ala Lys Leu Thr Lys Thr 

265 

Met Met Lys 



Gly Asn 
205 

Ser Met Tyr Ala His 
220 

Leu Asp 



Pro Gly Pro 
280 

Asn Tyr Lys 
295 

Thr Arg Thr 
310 

Ala Met Ala 

Leu lie Phe 

Asn Asn Leu 
360 

Arg Asp Thr 
375 

Thr Thr val 
390 

Tyr Pro Gly Met val 
405 

lie Trp Ala Lys lie 
420 

Leu lie 



Lys Asn 

Arg lie 

Lys lie 
485 
Glu val 
500 

Ala Pro 



Gly Gly Phe 
440 

Thr Pro val 
455 

Asn ser Phe 
470 

Glu Trp Glu 



lie Pro Gin 

Thr Leu Asp 
315 

Thr Ala Ala 

330 
Ala Gly Pro 
345 

Met Phe Thr 

Asp Leu Phe 

pro Thr Val 
395 

Trp Gin Asp 

410 
Pro His Thr 
425 

Gly Leu Lys 
Pro Ala Asn 



lie Thr Gin 
475 

lie Gin Lys 
490 

Gin Phe Thr ser Asn Tyr 

505 

Asp Asn Ala Gly Ala Tyr 



Leu Asn 

Asn Phe 

Gin Gin 
285 
Gly Arg 
300 

Gly Arg 

Asn Asp 

Asn lie 

Ser Glu 
365 
Gly His 
380 

Asp Asp 

Arg Asp 

Asp Gly 

Ser Pro 
445 
Pro Ala 
460 

Tyr Ser 
Glu Arg 
Gly Ala 
Lys Glu 



Gin Tyr Leu 
240 
Gin Gly Asn 

255 
Ser Gly Tyr 
270 

Arg Phe Ser 

Asn Asn Ser 

Trp ser Asn 
320 

Ala Thr Asp 
335 

Thr Gly Asn 
350 

Asp Glu Leu 

Leu Ala Thr 

Val Asp Gly 
400 

lie Tyr Tyr 

415 
His Phe His 
430 

pro Pro Gin 

Thr Thr Phe 

Thr Gly Gin 
480 

Ser Lys Arg 

495 
Gin Asp ser 
510 

Pro Arg Ala 
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515 520 525 

lie Gly ser Arg Tyr Leu Thr Asn His Leu 
530 535 

<210> 58 
<211> 150 
<212> DNA 

<213> Artificial sequence 

»> -j n 

<223> Description of Artificial sequence; note = 
synthetic construct 

ataqcactcc cccccctgtc gcgttcgctc gttcgctggc tcgattgggg gggtggcagc 
tclaagagct gccagacgac ggccctctgg gccgtcgccc ccccaatcga gccagcgaac 
gagcgaacgc gacagggggg ggagtgccac 

<210> 59 
<211> 20 
<212> DNA 

<213> Artificial sequence 

^220^ 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 59 

ctctagcaag ggggttttgt 

<210> 60 
<211> 7 
<212> DNA 

<213> Artificial Sequence 
^220^ 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 60 
agtgtgg 

<210> 61 
<211> 158 
<212> DNA 

<213> Artificial sequence 
22 0^ 

<223> Description of Artificial sequence; note = 
synthetic construct 

aggtSgtgat gtcattgttg atgtcattat agttgtcacg cgatagttaa tgattaacag 
tcatgtgatg tgtgttatcc aataggatga aagcgcgcga atgagatctc gcgagacttc 
cggggtataa aaggggtgag tgaacgagcc cgccgcca 

<210> 62 
<211> 112 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence; note = 
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synthetic construct 

qgtggattct gggtatattc ccgcctacct gctgccgaag gtccaaccag agcttcagtg 60 

ggcgtggact aacctcgaag agtataaatt ggccgccctc aatctggagg ag ii£ 

<210> 63 
<211> 169 
<212> DNA 

<213> Artificial Sequence 

<220> „ . „. . _ 

<223> Description of Artificial Sequence; note = 

synthetic construct 

agtcaaagac ttttttgctt gggcaaaggt caaccaggtg ccggtgactc acgagtttat 60 

ggttcccaag aaagtggcgg gaactgagag ggcggagact tctagaaaac gcccactgga l£U 

tgacgtcacc aataccaact ataaaagtcc ggagaagcgg gcccggctc ioy 

<210> 64 
<211> 4721 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 64 „„„ an 

ttggccactc cctctatgcg cgctcgctcg ctcggtgggg cctgcggacc aaaggtccgc bo 

agacggcaga gctctgctct gccggcccca ccgagcgagc gagcgcgcat agagggagtg 120 

gEcaactcca tcactagggg taccgcgaag cgcctcccac gctgccgcgt cagcgctgac 180 

gtaaatcacg tcatagggga gtggtcctgt attagctgtc acgtgagtgc ttttgcgaca 240 

ttttgcgaca ccacgtggcc atttgaggta tatatggccg agtgagcgag caggatctcc 300 

attttgaccg cgaaatttga acgagcagca gccatgccgg gtttctacga gatcgtgatc 360 

aaggtgccga gcgacctgga cgagcacctg ccgggcattt ctgactcgtt tgtgaactgg 420 

gtggccgaga aggaatggga gctgcccccg gattctgaca tggatctgaa tctgatcgag 480 

caggcacccc tgaccgtggc cgagaagctg cagcgcgact tcctggtcca atggcgccgc 540 

gtgagtaagg ccccggaggc cctgttcttt gttcagttcg agaagggcga gagctacttc 600 

caccttcacg ttctggtgga gaccacgggg gtcaagtcca tggtgctagg ccgcttcctg 660 

agtcagattc gggagaagct ggtccagacc atctaccgcg gggtcgagcc cacgctgccc 720 

aactggttcg cggtgaccaa gacgcgtaat ggcgccggcg gggggaacaa ggtggtggac 780 

gagtgctaca tccccaacta cctcctgccc aagacccagc ccgagctgca gtgggcgtgg 840 
actaacatgg aggagtatat aagcgcgtgt ttgaacctgg ccgaacgcaa acggctcgtg 
gcgcagcacc tgacccacgt cagccagacg caggagcaga acaaggagaa tctgaacccc 

iattctgacg cgcccgtgat caggtcaaaa acctccgcgc gctacatgga gctggtcggg 1020 

tggctggtgg accggggcat cacctccgag aagcagtgga tccaggagga ccaggcctcg 1080 

tacatctcct tcaacgccgc ctccaactcg cggtcccaga tcaaggccgc gctggacaat 1140 

gccggcaaga tcatggcgct gaccaaatcc gcgcccgact acctggtggg gccctcgctg 1200 

cccgcggaca ttaaaaccaa ccgcatctac cgcatcctgg agctgaacgg gtacgatcct 1260 

gcctacgccg gctccgtctt tctcggctgg gcccagaaaa agttcgggaa gcgcaacacc i^u 

atctggctgt ttgggcccgc caccaccggc aagaccaaca ttgcggaagc catcgcccac ii80 

gccgtgccct tctacggctg cgtcaactgg accaatgaga actttccctt caacgattgc 1440 

gtcgacaaga tggtgatctg gtgggaggag ggcaagatga cggccaaggt cgtggagtcc isoo 

gccaaggcca ttctcggcgg cagcaaggtg cgcgtggacc aaaagtgcaa gtcgtccgcc 1560 

cagatcgacc ccacccccgt gatcgtcacc tccaacacca acatgtgcgc cgtgattgac lb^o 

gggaacagca ccaccttcga gcaccagcag ccgttgcagg accggatgtt caaatttgaa 1680 

ctcacccgcc gtctggagca cgactttggc aaggtgacga agcaggaagt caaagagttc 1/40 

ttccgctggg ccagtgatca cgtgaccgag gtggcgcatg agttctacgt cagaaagggc 1800 

ggagccagca aaagacccgc ccccgatgac gcggatataa gcgagcccaa gcgggcctgc 1860 

ccctcagtcg cggatccatc gacgtcagac gcggaaggag ctccggtgga ctttgccgac iszo 

aggtaccaaa acaaatgttc tcgtcacgcg ggcatgattc agatgctgtt tccctgcaaa 1980 



900 
960 
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acgtgcgaga gaatgaatca gaatttcaac atttgcttca cacacggggt cagagactgt 2040 

ttagagtgtt tccccggcgt gtcagaatct caaccggtcg tcagaaaaaa gacgtatcgg 2100 

aaactctgcg cgattcatca tctgctgggg cgggcgcccg agattgcttg ctcggcctgc 2160 

gacctggtca acgtggacct ggacgactgc gtttctgagc aataaatgac ttaaaccagg 2220 

tatggctgcc gatggttatc ttccagattg gctcgaggac aacctctctg agggcattcg 2280 

cgagtggtgg gacctgaaac ctggagcccc gaaacccaaa gccaaccagc aaaagcagga 2340 

caacggccgg ggtctggtgc ttcctggcta caagtacctc ggacccttca acggactcga 2400 

caagggggag cccgtcaacg cggcggacgc agcggccctc gagcacgaca aggcctacga 2460 

ccagcagctc aaagcgggtg acaatccgta cctgcggtat aaccacgccg acgccgagtt 2520 

tcaggagcgt ctgcaagaag atacgtcatt tgggggcaac ctcgggcgag cagtcttcca 2580 

ggccaagaag cgggttctcg aacctctcgg tctggttgag gaaggcgcta agacggctcc 2640 

tgcaaagaag agaccggtag agccgtcacc tcagcgttcc cccgactcct ccacgggcat 2700 

cggcaagaaa ggccagcagc ccgccagaaa gagactcaat ttcggtcaga ctggcgactc 2760 

agagtcagtc cccgaccctc aacctctcgg agaacctcca gcagcgccct ctagtgtggg 2820 

atctggtaca gtggctgcag gcggtggcgc accaatggca gacaataacg aaggtgccga 2880 

cggagtgggt aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt 2940 

cattaccacc agcacccgaa cctgggccct gcccacctac aacaaccacc tctacaagca 3000 

aatctccagt gaaactgcag gtagtaccaa cgacaacacc tacttcggct acagcacccc 3060 

ctgggggtat tttgacttta acagattcca ctgccacttc tcaccacgtg actggcagcg 3120 

actcatcaac aacaactggg gattccggcc caagaagctg cggttcaagc tcttcaacat 3180 

ccaggtcaag gaggtcacga cgaatgacgg cgttacgacc atcgctaata accttaccag 3240 

cacgattcag gtattctcgg actcggaata ccagctgccg tacgtcctcg gctctgcgca 3300 

ccagggctgc ctgcctccgt tcccggcgga cgtcttcatg attcctcagt acggctacct 3360 

gactctcaac aatggcagtc agtctgtggg acgttcctcc ttctactgcc tggagtactt 3420 

cccctctcag atgctgagaa cgggcaacaa ctttgagttc agctacagct tcgaggacgt 3480 

gcctttccac agcagctacg cacacagcca gagcctggac cggctgatga atcccctcat 3540 

cgaccagtac ttgtactacc tggccagaac acagagtaac ccaggaggca cagctggcaa 3600 

tcgggaactg cagttttacc agggcgggcc ttcaactatg gccgaacaag ccaagaattg 3660 

gttacctgga ccttgcttcc ggcaacaaag agtctccaaa acgctggatc aaaacaacaa 3720 

cagcaacttt gcttggactg gtgccaccaa atatcacctg aacggcagaa actcgttggt 3780 

taatcccggc gtcgccatgg caactcacaa ggacgacgag gaccgctttt tcccatccag 3840 

cggagtcctg atttttggaa aaactggagc aactaacaaa actacattgg aaaatgtgtt 3900 

aatgacaaat gaagaagaaa ttcgtcctac taatcctgta gccacggaag aatacgggat 3960 

agtcagcagc aacttacaag cggctaatac tgcagcccag acacaagttg tcaacaacca 4020 

gggagcctta cctggcatgg tctggcagaa ccgggacgtg tacctgcagg gtcccatctg 4080 

ggccaagatt cctcacacgg atggcaactt tcacccgtct cctttgatgg gcggctttgg 4140 

acttaaacat ccgcctcctc agatcctgat caagaacact cccgttcccg ctaatcctcc 4200 

ggaggtgttt actcctgcca agtttgcttc gttcatcaca cagtacagca ccggacaagt 4260 

cagcgtggaa atcgagtggg agctgcagaa ggaaaacagc aagcgctgga acccggagat 4320 

tcagtacacc tccaactttg aaaagcagac tggtgtggac tttgccgttg acagccaggg 4380 

tgtttactct gagcctcgcc ctattggcac tcgttacctc acccgtaatc tgtaattgca 4440 

tgttaatcaa taaaccggtt gattcgtttc agttgaactt tggtctcctg tgcttcttat 4500 

cttatcggtt tccatagcaa ctggttacac attaactgct tgggtgcgct tcacgataag 4560 

aacactgacg tcaccgcggt acccctagtg atggagttgg ccactccctc tatgcgcgct 4620 

cgctcgctcg gtggggcctg cggaccaaag gtccgcagac ggcagagctc tgctctgccg 4680 

gccccaccga gcgagcgagc gcgcatagag ggagtggcca a 4721 

<210> 65 
<211> 623 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 65 

Met Pro Gly Phe Tyr Glu lie Val lie Lys Val Pro Ser Asp Leu Asp 

15 10 15 

Glu His Leu Pro Gly lie ser Asp Ser Phe Val Asn Trp val Ala Glu 

20 25 30 

Lys Glu Trp Glu Leu Pro Pro Asp ser Asp Met Asp Leu Asn Leu lie 
35 40 45 
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Glu Gin Ala Pro Leu Thr val Ala Glu Lys Leu Gin Arg Asp Phe Leu 

50 55 60 

val Gin Trp Arg Arg val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 " 70 75 80 

Gin Phe Glu Lys Gly Glu ser Tyr Phe His Leu His Val Leu val Glu 

85 90 95 

Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu ser Gin lie 

100 105 110 

Ara Glu Lys Leu Val Gin Thr lie Tyr Arg Gly val Glu Pro Thr Leu 

y 115 120 125 

Pro Asn Trp Phe Ala val Thr Lys Thr Arg Asn Gly Ala Gly Gly Gly 

130 135 140 

Asn Lys val val Asp Glu cys Tyr lie Pro Asn Tyr Leu Leu Pro Lys 
145 150 155 160 

Thr Gin pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu Tyr lie 

165 170 175 

ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala Gin His 

180 185 190 

Leu Thr His val ser Gin Thr Gin Glu Gin Asn Lys Glu Asn Leu Asn 

195 200 205 

pro Asn Ser Asp Ala Pro val lie Arg ser Lys Thr Ser Ala Arg Tyr 

210 215 220 

Met Glu Leu val Gly Trp Leu val Asp Arg Gly lie Thr Ser Glu Lys 
225 230 235 240 

Gin Trp lie Gin Glu Asp Gin Ala ser Tyr lie ser Phe Asn Ala Ala 

245 250 255 

Ser Asn Ser Arg ser Gin lie Lys Ala Ala Leu Asp Asn Ala Gly Lys 

260 265 270 

lie Met Ala Leu Thr Lys ser Ala Pro Asp Tyr Leu Val Gly Pro ser 

275 280 285 

Leu Pro Ala Asp lie Lys Thr Asn Arg lie Tyr Arg He Leu Glu Leu 

290 295 300 

Asn Gly Tyr Asp Pro Ala Tyr Ala Gly ser Val Phe Leu Gly Trp Ala 
305 310 315 320 

Gin Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly Pro Ala 

325 330 335 

Thr Thr Gly Lys Thr Asn lie Ala Glu Ala He Ala His Ala val Pro 

340 345 350 

Phe Tyr Gly cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe Asn Asp 

355 360 365 

cys val Asp Lys Met Val lie Trp Trp Glu Glu Gly Lys Met Thr Ala 

370 375 380 

Lys val val Glu ser Ala Lys Ala lie Leu Gly Gly ser Lys Val Arg 
385 390 395 , 400 

Val Asp Gin Lys cys Lys ser Ser Ala Gin He Asp Pro Thr Pro val 

405 410 415 

lie val Thr ser Asn Thr Asn Met cys Ala Val lie Asp Gly Asn ser 

420 425 430 

Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe Lys Phe 

435 440 445 

Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys val Thr Lys Gin 

450 455 460 

Glu val Lys Glu Phe Phe Arg Trp Ala ser Asp His val Thr Glu val 
465 470 475 480 

Ala His Glu Phe Tyr val Arg Lys Gly Gly Ala Ser Lys Arg Pro Ala 

485 490 495 

Pro Asp Asp Ala Asp lie ser Glu Pro Lys Arg Ala cys Pro ser Val 

500 505 510 

Ala Asp Pro ser Thr ser Asp Ala Glu Gly Ala Pro val Asp Phe Ala 

515 520 525 

Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met He Gin Met 

530 535 540 

Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe Asn He 
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545 550 555 n 560 

Cvs Phe Thr His Gly val Arg Asp cys Leu Glu Cys Phe Pro Gly Val 

565 570 575 

Ser Glu ser Gin Pro Val val Arg Lys Lys Thr Tyr Arg Lys Leu cys 

580 585 590 

Ala lie His His Leu Leu Gly Arg Ala Pro Glu lie Ala Cys ser Ala 

595 600 605 

cys Asp Leu val Asn val Asp Leu Asp Asp cys val ser Glu Gin 
610 615 620 

<210> 66 
<211> 737 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 66 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 

1 5 10 15 

Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 

20 25 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asn Gly Arg Gly Leu Val Leu Pro 

35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 

50 55 60 

val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 . 80 

Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 

85 90 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr ser Phe Gly Gly 

P 100 105 HO 

Asn Leu Gly Arg Ala val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 

115 120 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Ala Lys Lys Arg 

130 135 140 

pro val Glu Pro ser Pro Gin Arg ser Pro Asp ser ser Thr Gly lie 
145 150 155 160 

Gly Lys Lys Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 

165 170 175 

Thr Gly Asp ser Glu ser val Pro Asp Pro Gin Pro Leu Gly Glu Pro 

180 185 1?0 

pro Ala Ala Pro ser ser val Gly ser Gly Thr val Ala Ala Gly Gly 

195 200 205 

Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly val Gly Asn 

210 215 220 

Ala Ser Gly Asn Trp His cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 

lie Thr Thr ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 

245 250 255 

Leu Tvr Lys Gin lie ser ser Glu Thr Ala Gly Ser Thr Asn Asp Asn 

y 7 260 265 270 

Thr Tyr Phe Gly Tyr ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 

275 280 285 

Phe His cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn 

290 295 300 

Asn Trp Gly Phe Arg Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn lie 
305 310 315 320 

Gin val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn 

325 330 335 

Asn Leu Thr Ser Thr lie Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu 
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340 345 350 

Pro Tvr Val Leu Gly Ser Ala His Gin Gly cys Leu Pro Pro Phe Pro 

355 360 365 

Ala asp val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 

370 375 380 

Gly ser Gin Ser Val Gly Arg ser ser Phe Tyr cys Leu Glu Tyr Phe 
385 390 395 400 

Pro ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe ser Tyr Ser 

405 410 415 

Phe Glu Asp val Pro Phe His Ser ser Tyr Ala His ser Gin ser Leu 

420 425 430 

Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ala 

435 440 445 

Ara Thr Gin ser Asn Pro Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin 

y 450 455 460 

Phe Tvr Gin Gly Gly Pro Ser Thr Met Ala Glu Gin Ala Lys Asn Trp 
465 470 475 , 480 

Leu Pro Gly Pro cys Phe Arg Gin Gin Arg val ser Lys Thr Leu Asp 

485 490 495 

Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 

500 505 510 

Leu Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 

515 520 525 

His Lys Asp Asp Glu Asp Arg Phe Phe Pro ser ser Gly val Leu He 

530 535 540 

Phe Gly Lys Thr Gly Ala Thr Asn Lys Thr Thr Leu Glu Asn val Leu 
545 550 555 560 

Met Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro val Ala Thr Glu 

565 570 575 

Glu Tyr Gly lie val Ser Ser Asn Leu Gin Ala Ala Asn Thr Ala Ala 

580 585 n 590 

Gin Thr Gin val val Asn Asn Gin Gly Ala Leu Pro Gly Met Val Trp 

595 600 605 

Glh Asn Arg Asp val Tyr Leu Gin Gly Pro lie Trp Ala Lys He Pro 

610 " 615 620 

His Thr Asp Gly Asn Phe His Pro ser Pro Leu Met Gly Gly Phe Gly 
625 630 635 640 

Leu Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro 

645 650 n 655 

Ala Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala ser Phe He 

660 665 n , 670 

Thr Gin Tyr ser Thr Gly Gin val ser val Glu lie Glu Trp Glu Leu 

675 680 685 

Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu lie Gin Tyr Thr Ser 

690 695 700 

Asn Phe Glu Lys Gin Thr Gly Val Asp Phe Ala val Asp Ser Gin Gly 
705 710 715 720 

Val Tyr Ser Glu pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn 

725 730 735 

Leu 



<210> 67 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note 
synthetic construct 

<400> 67 

Gly Ser Ser Asn Ala ser Asp Thr 
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<210> 68 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 68 

Thr Thr ser Gly Gly Thr Leu Asn Gin Gly Asn Ser Ala Thr 
1 5 10 

<210> 69 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 69 

Asn Gly Arg Ala His Ala 
1 5 

<210> 70 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 70 

Ser lie Gly Tyr Pro Leu Pro 
1 5 

<210> 71 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 

<400> 71 

Lys Phe Asn Lys Pro Phe val Phe Leu lie 
15 10 

<210> 72 
<211> 22 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence; note = 
synthetic construct 
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<400> 72 . 

Asn lie ser Leu Asp Asn Pro Leu Glu Asn pro Ser ser Leu Pne Asp 
15 10 15 

Leu Val Ala Arg lie Lys 

20 
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